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Title: US-09-856-681A-2 
Perfect score: 5450 

Sequence: 1 MRSEALLLYFTLLHFAGAGF PPKPSFAPLSTSMKPNDACT 1030 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1586107 seqs, 282547505 residues 

Total number of hits satisfying chosen parameters: 1586107 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : A__Geneseq_29 Jan04 : * 

1: geneseqpl980s : * 

2: geneseqpl990s : * 

3: geneseqp2000s : * 

4: geneseqp2001s : * 

5: geneseqp2002s : * 

6: geneseqp2003as : * 

7: geneseqp2003bs : * 

8 : geneseqp2004s : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
AAY71460 

ID AAY71460 standard; protein; 1030 AA. 
XX 

AC AAY714 60; 
XX 

DT 04-OCT-2000 (first entry) 
XX 

DE Human semaphorin 6A-1. 
XX 

KW Human; semaphorin 6A-1; (HSA) SEMA6A-1 ; neuronal development; apoptosis; 

KW neuronal regeneration; Ena/VASP protein family; immunomodulatory; 

KW gene therapy; diagnostic agent; therapeutic agent; differentiation; 

KW cytoskeletal stabilisation; plasticity. 
XX 

OS Homo sapiens . 



XX 

FH Key 

FT Binding-site 
FT 
FT 

FT Binding-site 
FT 
FT 

FT Binding-site 
FT 
FT 
XX 

PN WO200031252-A1 
XX 

PD 02-JUN-2000. 
XX 

PF 26-NOV-1999; 99WO-EP009215 . 
XX 

PR 26-NOV-1998; 98EP-0012244 1 . 
XX 

PA (PLAC ) MAX PLANCK GES FOERDERUNG WISSENSCHAFTEN . 
XX 

PI Behl C, Klostermann A; 
XX 

DR WPI; 2000-400065/34. 

DR N-PSDB; AAD01233. 
XX 

PT Nucleic acid coding for human semaphorin 6A-1 used as diagnostic agent, 

PT therapeutic agent, for modulating immune system, in gene therapy or for 

PT effecting differentiation, cytoskeletal stabilization and/or plasticity. 
XX 

PS Example 1; Page 29-33; 53pp; English. 
XX 

CC The present sequence is a transmembranous human semaphorin 6A-1 

CC ( (HSA) SEMA6A-1 ) which is involved in neuronal development and 

CC regeneration mechanisms during apoptosis. Semaphorin is a family of 

CC proteins displaying secreted or transmembrane-based repulsive guidance 

CC cues critically involved in neuronal development. The present sequence 

CC was isolated from human 1-ZAP Express cDNA library which was screened 

CC using a PCR fragment amplified from human neuroblastoma cell line SK-N-MC 

CC cDNA. The (HSA) SEMA6A-1 protein contains a Zyxin-like domain that 

CC selectively binds to members of Ena/VASP protein family especially Evl . 

CC Expression of (HSA) SEMA6A-1 is highest in embryonic brain and kidney and 

CC moderate in lung. The present sequence is useful as diagnostic and 

CC therapeutic agents, for modulating the immune system, in gene therapy, 

CC for effecting differentiation, cytoskeletal stabilisation and plasticity 

XX 

SQ Sequence 1030 AA; 

Query Match 100.0%; Score 5450; DB 3; Length 1030; 
Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1030; Conservative 0; Mismatches- 0; Indels 0; Gaps 0; 

Qy 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

I I I I I I I I I I I I I I I I I I I I I I I 1 I II I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 



Location/ Qualifiers 
957. .961 

Vnote= "Specific binding motif for members of Ena/VASP 
protein family, especially Evl" 
959. .1030 

/note= "Zyxin-like domain that selectively binds to 
members of Ena/VASP protein family, especially Evl" 
1009. .1014 

/note= "Specific binding motif for members of Ena/VASP 
protein family, especially Evl" 



Qy 61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

Qy 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 180 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 180 

Qy 181 KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

Qy 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

Qy 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I II I I I I I I I I I I I I I I I I I 
Db 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

Qy 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSI FNRPWFLRTMVRYRL 420 

Qy 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 480 

Qy 481 GVEDKRIMGMQLDRASSSLYVAFSTCVIKVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 540 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 GVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 540 

Qy 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESR 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESR 600 

Qy 601 GGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRESYLKGHDQLVPVTLLAIAVI LAFV 660 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 GGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RES YLKGHDQLVPVTLLAI AVI LAFV 660 

Qy 661 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 720 

Qy 721 AILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERN 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 AILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERN 780 

Qy 781 QNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVAQ 840 

I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 QNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVAQ 840 

Qy 841 MALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLSQTGL 900 

I I I I I I II I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I 
Db 841 MALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLSQTGL 900 

Qy 901 SKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGDNPP 960 



Db 901 SKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGDNPP 960 

Qy 961 PAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKPDVPPKPSFAPLS 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 961 PAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKPDVPPKPSFAPLS 1020 

Qy 1021 TSMKPNDACT 1030 

I I I I I I I I I I 
Db 1021 TSMKPNDACT 1030 

RESULT 2 
ADA23362 



ID ADA23362 standard; protein; 1047 AA. 
XX 

AC ADA23362; 
XX 

DT 20-NOV-2003 (first entry) 
XX 

DE Human SECX polypeptide, SEC15. 
XX 

KW Human; secreted polypeptide; membrane-associated polypeptide; SECX; SEC1; 

KW SEC2; SEC3; SEC4; SEC5; SEC6; SEC7; SEC8; SEC 9 ; SEC10; SEC11; SEC12; 

KW SEC13; SEC14; SEC15; SECX-associated disorder; lung cancer; 

KW cardiovascular disease; oncology disease; immune disorder; 

KW autoimmune disease; transplant rejection; allergy; AIDS; infections; 

KW inflammatory disorder; arthritis; haematopoietic disorder; skin disorder; 

KW atherosclerosis; restenosis; neurological disease; Alzheimer's disease; 

KW trauma; wounds; spinal cord injury; skeletal disorder; cytostatic; 

KW antiinflammatory; immunosuppressive; anti-HIV; antiarthritic; 

KW antiarteriosclerotic; cardiant; neuroprotective; nootropic; vulnerary; 

KW antiallergic; cardiant; dermatological . 

XX 

OS Homo sapiens. 
XX 

PN US2003054514-A1. 
XX 

PD 20-MAR-2003. 
XX 

PF 19-SEP-2001; 2001US-00957187 . 
XX 

PR 09-MAR-1999; 99US-0123667P . 

PR 04-JAN-2000; 2000US-01744 85P . 

PR 08-MAR-2000; 2000US-0052078 1 . 

PR 19-SEP-2000; 2000US-0233798P . 

PR 20-SEP-2000; 2000US-0234082P . 
XX 

PA (SHIM/) SHIMKETS R A. 

PA (LARO/) LAROCHELLE W J. 
XX 

PI Shimkets RA, Larochelle WJ; 
XX 

DR WPI; 2003-540616/51. 

DR N-PSDB; ADA23361. 
XX 

PT New SECX nucleic acids, useful for treating or diagnosing a disorder 



PT e.g., lung cancer, cardiovascular and oncology diseases, immune disorder, 

PT and autoimmune disease. 

XX 

PS Claim 12; -Page 14; 118pp; English. 
XX 

CC The present invention relates to the isolation of human secreted or 

CC membrane-associated (SECX) polypeptides designated SEC1-SEC15, and the 

CC polynucleotide sequences encoding them. Also disclosed is a method for 

CC screening for a modulator of activity or latency of SECX. The SECX 

CC polypeptide and polynucleotide sequences may be used for treating or 

CC preventing SECX-associated disorders such as lung cancer, cardiovascular 

CC and oncology diseases, immune disorders, autoimmune diseases, transplant 

CC rejection, allergy, AIDS, infections, inflammatory disorders, arthritis, 

CC haematopoietic disorders, skin disorders, atherosclerosis, restenosis, 

CC neurological diseases (e.g. Alzheimer's disease), trauma, wounds, spinal 

CC cord injuries, and skeletal disorders. The present sequence represents a 

CC SECX polypeptide of the invention. 
XX 

SQ Sequence 1047 AA; 

Query Match 98.6%; Score 5371.5; DB 6; Length 1047; 

Best Local Similarity 97.7%; Pred. No. 0; 

Matches 1023; Conservative 0; Mismatches 7; Indels 17; Gaps 1; 

Qy 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHCNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

Qy 61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

I I I I I I I I I I I I I I I I I I I I I I II I I I I II I II I I I I I I I || I I I I I I I I I I I I I I M M 
Db 61 IMIMNGTLYIAARDHI YTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

Qy 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYD7VKHANVALFADG 180 

M I I I I I I I I I I I I I I II I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 180 

Qy 181 KLYSATVTDFLAI DAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

I I I I I I I I I I I I I II I II II I I I II I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I 
Db 181 KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

Qy 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

I I I I I I I I I I I I I M I I I I I M I I I I I I I I M I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

Qy 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVGTGRFKEQKSPDSTWTPVPDER 360 

Qy 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I M II I I 
Db 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

Qy 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVTNSEKCSYD 480 

I I I I I I I I I I I I I I I I II I I I I I I II I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 480 



Qy 



481 GVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 540 



Db 481 GVEDKRIMGMQLDRASSSLYVAFSTCVIKBPLGRDERHGKCKKTCIASRDPYCGWIKEGG 540 

Qy 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALN GHSSSLL 583 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

D *> 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNDISTPLPDNEMSYNTVYGHSSSLL 600 

QY 584 PSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHD 643 

I I I I I I I I I I I I I I I I I > I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 PSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVT RESYLKGHD 660 

QY 644 QLVPVTLLAIAVILAFVMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVT 703 

M I I M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I | I | | | | | | | | M I I I I I I I 
Db 661 QLVPVTLLAIAVILAFVMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVT 720 

QY 704 KLSGLFGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPT 763 

I I I I I I I I I I I I I I I I I I I II I I || I | | | | | | | | | | | | | | | | | | | | | | | M | || M | | | 
Db 721 KLSGLFGDTWSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPT 780 

Qy 764 LQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQ 823 

M I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I | M I I I I I I I I I I I I 
Db 781 LQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVIPTCLPLRASPSHIPSVWLPITQQ 840 

QY 824 GYQHEYVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQ 883 

I I M I I I I I M I I I II I I I I II II I I I I II I I I I I I I I I || | | | | | | | | | | | | | | | | | | 
Db 841 BYQHEYVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQ 900 

Qy 884 REASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSN 943 

I I I I M I I I I II I I I I I I I I I I I I I I I II I I I I I I I | | M I I I I I I I I I I I I I I I I M I I 
Db 901 REASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSN 960 

QY 944 S S H L S RNQ S FGRGDN P P PAP Q RVD S I QVH S S Q P S GQ AVT VS RQ P S LNAYN S LT R S G L KRT 1003 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 961 SSHLSRNQSFGRGDNPPPAPQRVDSIQVHSSQPSGQAVTVS RQPSLNAYNSLTRSGLKRT 1020 

Qy 1004 PSLKPDVPPKPSFAPLSTSMKPNDACT 1030 

I I I I I I I I I I I I II I I I I M I I I I I I I 
Db 1021 PSLKPDVPPKPSFAPLSTSMKPNDACT 1047 

RESULT 3 
AAB90731 

ID AAB90731 standard; protein; 975 AA. 
XX 



AC AAB90731; 
XX 

DT 07-JUN-2001 (first entry) 
XX 

DE Human CJ145_1 protein sequence SEQ ID 161. 
XX 

KW Human; secreted protein; nutrient; cytokine modulator; proliferation; 

KW differentiation; immune system modulator; tissue growth; chemotactic; 

KW haemostatic; thrombolytic; anti-inflammatory; tumour inhibition; 

KW haematopoiesis . 
XX 

OS Homo sapiens . 



PN WO200119988-A1. 
XX 

PD 22-MAR-2001. 
XX 

PF 14-SEP-2000; 2000WO-US025135 . 
XX 

PR 17-SEP-1999; 99US-00398829 . 
XX 

PA (GEMY ) GENETICS INST INC. 
XX 

PI Jacobs K, Mccoy JM, Lavallie ER, Collins-Racie LA, Evans C; 

PI Merberg D, Treacy M, Bowman MR, Spaulding V, Agostino MJ; 
XX 

DR WPI; 2001-244801/25. 

DR N-PSDB; AAF98469. 
XX 

PT Isolated nucleic acids encoding polypeptides, useful for modulating e.g. 

PT cytokine and cell proliferation/differentiation activity, the immune 

PT system and hematopoiesis regulating activity. 
XX 

PS Disclosure; Page 487-490; 557pp; English. 
XX 

CC Human cDNA clones represented in AAF98374 - AAF98489 encode secreted 

CC proteins AAB90667 - AAB90750. The cDNA clones are isolated from various 

CC tissue types, and may be used in the prevention, treatment and diagnosis 

CC of diseases associated with inappropriate protein expression. The 

CC polypeptides and nucleic acids may be used as nutrients or to modulate 

CC cytokine and cell prolif eration/dif f erentiation activity and may also be 

CC involved in modulation of the immune system. The cDNA sequences, 

CC proteins, their agonists and/or antagonists exhibit haematopoiesis 

CC regulating activity; tissue growth activity; activin/inhibin activity; 

CC chemotactic/chemokinetic activity; haemostatic and thrombolytic activity; 

CC receptor/ligand activity; anti-inflammatory activity; haematopoiesis 

CC activity; cadherin/tumour suppressor activity; and/or tumour inhibition 

CC activity. Included in the invention are probes represented in AAF98 4 90 - 

CC AAF98572 which are specific for the cDNA clones encoding the secreted 

CC proteins 
XX 

SQ Sequence 975 AA; 

Query Match 93.6%; Score 5103.5; DB 4; Length 975; 
Best Local Similarity 94.4%; Pred. No. 0; 

Matches 972; Conservative 0; Mismatches 3; Indels 55; Gaps 1; 

Qy 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

Qy 61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

Qy 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYD7VKHANVALFADG 180 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGM7VRCPYDAKHANV7VLF7VDG 180 



QY 



181 KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 



Db 181 KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

Qy 241 VEYNTMGfCWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I! I I I I I I I I I I I I I I I I I I I 
Db 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

Qy 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

Qy 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

Qy 421 TKIAVDTAAGP YQNHT WFLGSEKGI I LKFLARI GNSGFLNDSLFLEEMSVYNS EKCS YD 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 421 TKIAVDTAAGPYQNHTVVFLGSEKGI I LKFLARI GNSGFLNDSLFLEEMSVYNS EKCS YD 480 

Qy 4 81 GVEDKRIMGMQLDRAS S SLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWI KEGG 540 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I 
Db 4 81 GVEDKRIMGMQLDRAS S SLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 540 

Qy 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESR 600 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 
Db 541 ACSHLS PNSRLTFEQDI ERGNTDGLGDCHNS FVALN 576 

Qy 601 GGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRESYLKGHDQLVPVTLLAIAVI LAFV 660 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 577 GVI RES YLKGHDQLVPVTLLAIAVI LAFV 605 

Qy 661 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 720 

I I I II I I I I II I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I 
Db 606 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 665 

Qy 721 AILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERN 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 666 AILTPLMHNGKLATPGNTAKMLIK7VDQHHLDLTALPTPESTPTLQQKRKPSRGSREWERN 725 

Qy 781 QNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVAQ 840 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 726 QNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVAQ 785 

Qy 841 MALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLSQTGL 900 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 786 MALEDQAATLEYKTIKEHFSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLFQTGL 845 

Qy 901 SKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGDNPP 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I 
Db 84 6 SKRLEMHHSFSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGDNPP 905 

Qy 961 PAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKPDVPPKPSFAPLS 1020 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II 
Db 906 PAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKPDVPPKPSFAPLS 965 

Qy 1021 TSMKPNDACT 1030 

I I I I II I I I I 



Db 



966 TSMKPNDACT 975 



RESULT 4 
AAW64221 

ID AAW64221 standard; protein; 974 AA. 
XX 

AC AAW64221; 
XX 

DT 06-OCT-1998 (first entry) 
XX 

DE Human secreted protein from clone CJ145 1. 

XX ~~ 

KW Secreted protein; human fetal brain; nutrition; cytokine; stimulant; 

KW cell proliferation; differentiation; immune system; suppressor; ligand; 

KW regulator; hematopoiesis ; tissue growth; activin; inhibin; haemostatic; 

KW chemotaxis; chemokinetic; thrombosis; receptor; cadherin; tumour; 

KW anti-inflammatory. 
XX 

OS Homo sapiens. 
XX 

PN WO9827205-A2 . 
XX 

PD 25-JUN-1998. 
XX 

PF 17-DEC-1997; 97WO-US023330 . 
XX 

PR 18-DEC-1996; 96US-00769192 . 

PR 13-JAN-1997; 97US-00783401 . 

PR 16-DEC-1997; 97US-009 91872 . 
XX 

PA (GEMY ) GENETICS INST INC. 
XX 

PI Jacobs K, Mccoy JM, Lavallie ER, Racie LA, Merberg D, Treacy M; 

PI Spaulding V, Agostino MJ; 

XX 

DR WPI; 1998-362774/31. 

DR N-PSDB; AAV44295. 
XX 

PT New polynucleotides and secreted proteins - obtained from human foetal 

PT brain, human adult testes, human adult brain and human adult salivary 

PT gland cDNA libraries. 
XX 

PS Claim 17 j ; Page 71-74; HOpp; English. 
XX 

CC This sequence represents a novel secreted protein from clone CJ145_1 

CC isolated from a human fetal brain cDNA library. This protein has 

CC applications for nutritional use, cytokine and cell 

CC prolif eration/dif ferentiation activity, immune stimulating or suppressing 

CC activity, hematopoiesis regulating activity, tissue growth activity, 

CC activin/inhibin activity, chemotactic/chemokinetic activity, haemostatic 

CC and thrombotic activity, receptor/ligand activity, anti-inflammatory 

CC activity, cadherin/tumour invasion suppressor activity, tumour inhibition 

CC activity and other activities 

XX 

SQ Sequence 974 AA; 



Query Match 93.4%; Score 5088; DB 2; Length 974; 

Best Local Similarity 94.3%; Pred. No. 0; 

Matches 971; Conservative 0; Mismatches 3; Indels 56; Gaps 2; 

Qy 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

Qy 61 IMIMNGTLYI7\ARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 IMIMNGTLYIAARDHIYTVDIDTSH-EEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 119 

Qy 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 12 0 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 179 

Qy 181 KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 180 KLYSATVTDFLAI DAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 239 

Qy 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

I I II I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 24 0 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 2 99 

Qy 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 300 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 359 

Qy 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 360 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 419 

Qy 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFIARIGNSGFLNDSLFLEEMSVYNSEKCSYD 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 420 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 479 

Qy 481 GVEDKRIMGMQLDRASSSLYVAFSTCVIKVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 480 GVEDKRIMGMQLDRASSSLYVAFSTCVIKVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 539 

Qy 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESR 600 

I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I 
Db 540 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALN 575 

Qy 601 GGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHDQLVPVTLLAIAVT LAFV 660 

I II II I I I I I I I I I I I I I I I I I I I I I I I I 

Db 576 GVI RES YLKGHDQLVPVTLLAI AVI LAFV 604 

Qy 661 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 605 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 664 

Qy 721 AILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERN 7 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 665 AILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERN 724 



Qy 



781 



QNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVAQ 840 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 

Db 725 QNL I NACT KDMP PMGS P VI PT DL P L RAS P S H I P S VWL P I TQQG YQH E YVDQ P KMS E VAQ 784 

Qy 841 MALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLSQTGL 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 7 85 MALEDQAATLEYKTIKEHFSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLFQTGL 844 

Qy 901 SKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGDNPP 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 845 SKRLEMHHSFSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGDNPP 904 

Qy 961 PAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKPDVPPKPSFAPLS 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 905 PAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKPDVPPKPSFAPLS 964 

Qy 1021 TSMKPNDACT 1030 

I I I I I I I I I I 
Db 965 TSMKPNDACT 974 



RESULT 5 


AAB23030 


ID 


AAB23030 standard; protein; 939 AA. 


XX 




AC 


AAB23030 ; 


XX 




DT 


lo-JAN— zu(J± (rirst entry) 


XX 




DE 


Human semapnorin protein-like splice variant, SECX zbb4yj^-l. 


XX 




KW 


SECX protein; human; secreted; membrane-associated; cancer; 


KW 


proliferation regulator; differentiation regulator; non-malignant tumour; 


KW 


immune disorder; autoimmune disease; transplant rejection; allergy; AIDS; 


KW 


infection; inflammatory disorder; arthritis; haematopoietic disorder; 


KW 


skin disorder; cardiovascular disorder; atherosclerosis; restenosis; 


KW 


neurological disease; Alzheimer f s disease; trauma; wounding; 


KW 


spinal cord injury; skeletal disorder; cytostatic; immunosuppressive; 


KW 


anti-HIV; antiinflammatory; antiarthritic; antiarteriosclerotic; 


KW 


neuroprotective; vulnerary; antiallergic; antimicrobial; cardiant; 


KW 


dermatological; gene therapy. 


XX 




OS 


Homo sapiens . 


XX 




PN 


WO200053742-A2. 


XX 




PD 


14-SEP-2000. 


XX 




PF 


09-MAR-2000; 2000WO-US006280 . 


XX 




PR 


09-MAR-1999; 99US-0123667 P . 


PR 


08-MAR-2000; 2000US-0520781P . 


XX 




PA 


(CURA-) CURAGEN CORP. 


XX 




PI 


Shimkets RA; 


XX 




DR 


WPI; 2000-594318/56. 



DR N-PSDB; AAA93617. 
XX 

PT Novel human membrane associated or secreted polypeptides and 

PT polynucleotides useful for diagnosis, prevention and treatment of 

PT pathological states such as cancer, immune, cardiovascular and 

PT neurological disorders. 

XX 

PS Claim 1; Fig 2; 151pp; English. 
XX 

CC Sequences AAB23029-B23048 represent human SECX proteins. The SECX 

CC proteins of the invention are either secreted or membrane-associated 

CC proteins and act as regulator of cellular proliferation and 

CC differentiation. SECX proteins or nucleotides are useful for diagnosing 

CC the presence of, or predisposition to, a disease associated with altered 

CC levels of SECX proteins and nucleotides. The SECX proteins are also 

CC useful to screen compounds that modulate SECX activity or expression. The 

CC interaction of a SECX protein with other cellular proteins may be useful 

CC to modulate the activity of a partner protein, cellular proliferation, 

CC cellular differentiation and cell survival. SECX nucleotides are useful 

CC for the recombinant expression of SECX protein, and may be used detect 

CC SECX mRNA or genetic lesions in the SECX gene. They may also be used to 

CC modulate SECX expression (e.g., using antisense oligonucleotides). SECX 

CC nucleic acid sequences are also useful for identifying a cell or tissue 

CC type in a biological sample, and in forensic biology. SECX primers or 

CC probes are useful for detecting the presence of SECX nucleotides and for 

CC screening tissue cultures for contamination. Diseases that may be treated 

CC or prevented using SECX proteins or nucleotides include cancer (e.g., 

CC colorectal carcinoma, prostate cancer), benign tumours, immune disorders 

CC (including autoimmune diseases, transplant rejection, allergies, AIDS), 

CC infections, inflammatory disorders, arthritis, haematopoietic disorders, 

CC skin disorders, cardiovascular disorders, atherosclerosis, restenosis, 

CC neurological diseases (e.g., Alzheimer's disease), trauma (e.g., surgical 

CC or traumatic wounds, spinal cord injury), and skeletal disorders 
XX 

SQ Sequence 939 AA; 

Query Match 90.3%; Score 4921; DB 3; Length 939; 

Best Local Similarity 99.6%; Pred. No. 0; 

Matches 928; Conservative 2; Mismatches 2; Indels 0; Gaps 0; 



Qy 


1 


MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 


60 




M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 




Db 


1 


MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 


60 


Qy 


61 


IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 


120 






1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 


120 


Qy 


121 


IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 


180 




1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 


180 


Qy 


181 


KLYSATVT DFLAI DAVI YRS LGES PTLRTVKHDS KWLKEP YFVQAVD YGDYI YFFFREI A 


240 






I I I I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


KLYSATVTDFLAI DAVI YRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYI YFFFREI A 


240 



Qy 



241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 



DO 


0 Al 
Z. H. X 


wFVMTMPTAAn^PPVAnA/r^T<rMnMr , r^npvT f kotaTT ^ ft kapt WP^VPf^n^HFYFNTTLOAVT 


300 


Qy 


301 


DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 




Db 


oUX 


DVXKXrJ(jKL> V VliAl rol ir I JNbX F(jb/WO/\i DJrlXjJJX.rt.bVr 1 LiKr J\£iyJ\b r JJb iwirv r ucji\ 


360 


Qy 


361 


VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 


420 






1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


obi 


VrKF RrGLCA(=rbbbXjhKY Al oNbr FULIl XjJM r lJ\lnrLWD£jAVrDlr JNKJrWr IjKIMVKI r\Xi 


42 0 


Qy 


421 


TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II 




Db 


421 


1 KXAVD1 AACjF lyWrll VVr XiLjbrjJ\QjX 1 Xif\r IlBKXCjIn oKjc XtJN Do Lin XirjiMYlb v xinoHjJW^o i u 


T U \J 


Qy 


481 


GVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 


540 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 




Db 


4 o X 


CjVlijUJ\K±M(oiyiyijJJKAo j biii V/vr b 1 C VX J\ V Jr Xi^K^rjKnLjJ\Ur\.J\ 1 OX/\br\JJir I ^kjvwx jmjIjvj 


540 


Qy 


541 


ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESR 


600 




1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


o4 X 


AL.briJ_ibirJN oKij 1 c Hj^JJ J.iijK(ajM 1 DoXi^L'v^.rlJN b r V/vblNorlo b bXiXir b ± 1 i bL/O ±./-y^IjO i Hiors. 


600 


Qy 


601 


GGMLDWKHLLDS P D STD PLGAVS S HNHQDKKGVI RES YLKGHDQLVPVT LLAI AVI LAFV 


660 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 




Db 


oUl 


(aGMLDWKnij-LiDbFUb I JJr LCjAVbbnJNnyDJ\JAoVXK.Ejb I LrvbnuyjjVrv l J_iixrt.x/\vx ia/\r v 


\J \J \J 


Qy 


661 


MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 


720 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


DOl 


MGAVr bbl 1 V YC VCDrlKKKDVAV VyKKk.tS.JljXjl nbKKtabJYLbb V 1 J\XibvjXir vjjJiybJ\urJ\irrj 




Qy 


721 


AILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERN 


780 






1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 M : 1 1 1 1 1 1 




Db 


TOT 

/zx 


7\ t t rp T) T \/m'Kir , VT "A rpDrMTA T/TWTT T V7\ nnUUT T\T TAT DTDTTQTDTT PlfllfDF P <^ PfZ r PPFTAJFP7\T 

AILI PLMrlriGKLAI xrbJM 1 AKIYILX JSJVUynnXjUXjiAXiir 1 rtLo Lei Xjyyj\KiLi:bKkjl K.kWHjrUN 


7ft 0 


Qy 


781 


QNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVVVLPITQQGYQHEYVDQPKMSEVAQ 


840 






1 1 1 1 II 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 M 




DO 


/ O X 




840 


Qy 


841 


MALbDyAAl LililKl X KhnL bbx\b JrNn(jVISIXjVnjJ>JXiUbXir c J\ V JryKiL/\oijtar r Lr.M.bXiby 1 vjXi 


900 

zf \J \J 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


841 


MALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLSQTGL 


900 


Qy 


901 


SKRLEMHHSSSYGVDYKRSYPTNSLTRSHQAT 932 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 




Db 


901 


SKRLEMHHS S S YGVDYKRS YPTNSLTRSHLTT 932 





RESULT 6 
ADA23281 

ID ADA23281 standard; protein; 939 AA. 
XX 

AC ADA23281; 
XX 

DT 20-NOV-2003 (first entry) 
XX 

DE Human SECX polypeptide, SEC2 . 



XX 

KW Human; secreted polypeptide; membrane-associated polypeptide; SECX; SEC1; 

KW SEC2; SEC3; SEC4; SEC5; SEC6; SEC7; S EC 8 ; SEC9; SEC10; SEC11; SEC12; 

KW SEC13; SEC14; SEC15; SECX-associated disorder; lung cancer; 

KW cardiovascular disease; oncology disease; immune disorder; 

KW autoimmune disease; transplant rejection; allergy; AIDS; infections; 

KW inflammatory disorder; arthritis; haematopoietic disorder; skin disorder; 

KW atherosclerosis; restenosis; neurological disease; Alzheimer's disease; 

KW trauma; wounds; spinal cord injury; skeletal disorder; cytostatic; 

KW antiinflammatory; immunosuppressive; anti-HIV; antiarthritic; 

KW antiarteriosclerotic; cardiant; neuroprotective; nootropic; vulnerary; 

KW antiallergic; cardiant; dermatological . 

XX 

OS Homo sapiens . 
XX 

PN US2003054514-A1. 
XX 

PD 20-MAR-2003. 
XX 

PF 19-SEP-2001; 2001US-00957187 . 
XX 

PR 09-MAR-1999; 

PR 04-JAN-2000; 

PR 08-MAR-2000; 

PR 19-SEP-2000; 

PR 20-SEP-2000; 
XX 

PA (SHIM/) SHIMKETS R A. 

PA (LARO/) LAROCHELLE W J. 
XX 

PI Shimkets RA, Larochelle WJ; 
XX 

DR WPI; 2003-540616/51. 

DR N-PSDB; ADA23280. 
XX 

PT New SECX nucleic acids, useful for treating or diagnosing a disorder 

PT e.g., lung cancer, cardiovascular and oncology diseases, immune disorder, 

PT and autoimmune disease. 

XX 

PS Claim 12; Fig 2; 118pp; English. 
XX 

CC The present invention relates to the isolation of human secreted or 

CC membrane-associated (SECX) polypeptides designated SEC1-SEC15, and the 

CC polynucleotide sequences encoding them. Also disclosed is a method for 

CC screening for a modulator of activity or latency of SECX. The SECX 

CC polypeptide and polynucleotide sequences may be used for treating or 

CC preventing SECX-associated disorders such as lung cancer, cardiovascular 

CC and oncology diseases, immune disorders, autoimmune diseases, transplant 

CC rejection, allergy, AIDS, infections, inflammatory disorders, arthritis, 

CC haematopoietic disorders, skin disorders, atherosclerosis, restenosis, 

CC neurological diseases (e.g. Alzheimer's disease), trauma, wounds, spinal 

CC cord injuries, and skeletal disorders. The present sequence represents a 

CC SECX polypeptide of the invention. 
XX 

SQ Sequence 939 AA; 

Query Match 90.3%; Score 4921; DB 6; Length 939; 



99US-0123667P. 
2000US-0174485P. 
2000US-00520781. 
2000US-0233798P. 
2000US-0234082P. 



Best Local Similarity 99.6%; Pred. No. 0; 

Matches 928; Conservative 2; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I J I I I I I I I I 1 I I I I I I I I M 
Db 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

Qy 61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 1 I I I I I I 

Db 61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

Qy 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 180 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 180 

Qy 181 KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 KLYSATVTDFLAI DAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

Qy 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

I I I I i I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I 

Db 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

Qy 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

I I I I I I I I I I I II I I M I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

Qy 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 42 0 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 
Db 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

Qy 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 421 TKI AVDTAAGPYQNHT WFLGS EKGI I LKFLARI GNSGFLNDS LFLEEMSVYNSEKCS YD 480 

Qy 481 GVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIAS RDPYCGWIKEGG 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 481 GVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIAS RDPYCGWIKEGG 540 

Qy 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESR 600 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I 
Db 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESR 600 

Qy 601 GGMLDWKHLLDS PDSTDPLGAVSSHNHQDKKGVI RES YLKGHDQLVPVTLLAI AVI LAFV 660 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II II I I I II I I I I I I I I I I I I I I I 
Db 601 GGMLDWKHLLDS PDSTDPLGAVSSHNHQDKKGVI RES YLKGHDQLVPVTLLAI AVI LAFV 660 

Qy 661 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 720 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M 
Db 661 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 720 

Qy 721 AILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERN 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II : I I I I : I I I I I I 
Db 721 AI LTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKREPSRGTREWERN 780 

Qy 781 QNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVAQ 840 
I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I I I I I I I I II II I I I I I I I I II I I 



Db 781 QN L I NACT KDMP PMG S PVI P T DL P LRAS P S H I P SVWL P I TQQG YQHE YVDQ P KMS E VAQ 840 

Qy 841 MALEDQAATLEYKTIKEHLSS'KSPNHGVNLVENLDSLPPKVPQREASLGPPGASLSQTGL 900 

I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 841 MALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLSQTGL 900 

Qy 901 SKRLEMHHSSSYGVDYKRSYPTNSLTRSHQAT 932 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 901 SKRLEMHHSSSYGVDYKRSYPTNSLTRSHLTT 932 



RESULT 7 
AAB23031 

ID AAB23031 standard; protein; 884 AA. 
XX 

AC AAB23031; 
XX 

DT 16-JAN-2001 (first entry) 
XX 

DE Human semaphorin protein-like splice variant, SECX 2864933-2. 
XX 

KW SECX protein; human; secreted; membrane-associated; cancer; 

KW proliferation regulator; differentiation regulator; non-malignant tumour; 

KW immune disorder; autoimmune disease; transplant rejection; allergy; AIDS; 

KW infection; inflammatory disorder; arthritis; haematopoietic disorder; 

KW skin disorder; cardiovascular disorder; atherosclerosis; restenosis; 

KW neurological disease; Alzheimer's disease; trauma; wounding; 

KW spinal cord injury; skeletal disorder; cytostatic; immunosuppressive; 

KW anti-HIV; antiinflammatory; antiarthritic; antiarteriosclerotic; 

KW neuroprotective; vulnerary; antiallergic; antimicrobial; cardiant; 

KW dermatological ; gene therapy. 

XX 

OS Homo sapiens. 
XX 

PN WO200053742-A2 . 
XX 

PD 14-SEP-2000. 
XX 

PF 09-MAR-2000; 2000WO-US006280 . 
XX 

PR 09-MAR-1999; 99US-0123667P . 

PR 08-MAR-2000; 2000US-0520781P . 
XX 

PA (CURA-) CURAGEN CORP. 
XX 

PI Shimkets RA; 
XX 

DR WPI; 2000-594318/56. 

DR N-PSDB; AAA93618. 
XX 

PT Novel human membrane associated or secreted polypeptides and 

PT polynucleotides useful for diagnosis, prevention and treatment of 

PT pathological states such as cancer, immune, cardiovascular and 

PT neurological disorders. 

XX 

PS Claim 1; Fig 3; 151pp; English. 
XX 



CC Sequences AAB23029-B23048 represent human SECX proteins. The SECX 

CC proteins of the invention are either secreted or membrane-associated 

CC proteins and act as regulator of cellular proliferation and 

CC differentiation. SECX proteins or nucleotides are useful for diagnosing 

CC the presence of, or predisposition to, a disease associated with altered 

CC levels of SECX proteins and nucleotides. The SECX proteins are also 

CC useful to screen compounds that modulate SECX activity or expression. The 

CC interaction of a SECX protein with other cellular proteins may be useful 

CC to modulate the activity of a partner protein, cellular proliferation, 

CC cellular differentiation and cell . survival . SECX nucleotides are useful 

CC for the recombinant expression of SECX protein, and may be used detect 

CC SECX mRNA or genetic lesions in the SECX gene. They may also be used to 

CC modulate SECX expression (e.g., using antisense oligonucleotides). SECX 

CC nucleic acid sequences are also useful for identifying a cell or tissue 

CC type in a biological sample, and in forensic biology. SECX primers or 

CC probes are useful for detecting the presence of SECX nucleotides and for 

CC screening tissue cultures for contamination. Diseases that may be treated 

CC or prevented using SECX proteins or nucleotides include cancer (e.g., 

CC colorectal carcinoma, prostate cancer) , benign tumours, immune disorders 

CC (including autoimmune diseases, transplant rejection, allergies, AIDS) , 

CC infections, inflammatory disorders, arthritis, haematopoietic disorders, 

CC skin disorders, cardiovascular disorders, atherosclerosis, restenosis, 

CC neurological diseases (e.g., Alzheimer's disease), trauma (e.g., surgical 

CC or traumatic wounds, spinal cord injury) , and skeletal disorders 
XX 

SQ Sequence 884 AA; 

Query Match 84.2%; Score 4590.5; DB 3; Length 884; 

Best Local Similarity 93.7%; Pred. No. 0; 

Matches 87 3; Conservative 2; Mismatches 2; Indels 55; Gaps 1; 

Qy 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I 
Db 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

Qy 61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

Qy 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 180 

Qy 181 KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

Qy 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVXEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVXEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

Qy 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 



Qy 



361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 
I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 



Db 



361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 



Qy 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 TKIAVDT7^lGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 480 

Qy 481 GVEDKRIMGMQLDRASSSLYVAFSTCVIKVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 540 

1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 4 81 GVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 540 

Qy 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESR 600 

I I I I I I I I 'l I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALN 576 

Qy 601 GGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRESYLKGHDQLVPVTLLAIAVILAFV 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 577 GVIRESYLKGHDQLVPVTLLAIAVILAFV 605 

Qy 661 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I 
Db 606 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 665 

Qy 721 AILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERN 780 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I : I I I II I 
Db 666 AILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKREPSRGTREWERN 725 

Qy 781 QNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVAQ 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 726 QNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVAQ 785 

Qy 841 MALEDQAATLEYKTIKEHLSSKSPNHGWLVENLDSLPPKVPQREASLGPPGASLSQTGL 900 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 786 MALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLSQTGL 845 

Qy 901 SKRLEMHHSSSYGVDYKRSYPTNSLTRSHQAT 932 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 846 SKRLEMHHSSSYGVDYKRSYPTNSLTRSHLTT 877 



RESULT 8 
ADA23283 

ID ADA23283 standard; protein; 884 AA. 
XX 

AC ADA23283; 
XX 

DT 20-NOV-2003 (first entry) 
XX 

DE Human SECX polypeptide, SEC3 #1. 
XX 

KW Human; secreted polypeptide; membrane-associated polypeptide; SECX; SEC1; 

KW SEC2; SEC3; SEC4; SEC5; SEC6; SEC7 ; SEC8; SEC9; SEC10; SEC11; SEC12; 

KW SEC13; SEC14; SEC15; SECX-associated disorder; lung cancer; 

KW cardiovascular disease; oncology disease; immune disorder; 

KW autoimmune disease; transplant rejection; allergy; AIDS; infections; 

KW inflammatory disorder; arthritis; haematopoietic disorder; skin disorder; 

KW atherosclerosis; restenosis; neurological disease; Alzheimer's disease; 

KW trauma; wounds; spinal cord injury; skeletal disorder; cytostatic; 



KW antiinflammatory; immunosuppressive; anti-HIV; antiarthritic; 

KW antiarteriosclerotic; cardiant; neuroprotective; nootropic; vulnerary; 

KW antiallergic; cardiant; dermatological . 
XX 

OS Homo sapiens. 
XX 

PN US2003054514-A1. 
XX 

PD 20-MAR-2 003. 
XX 

PF 19-SEP-2001; 2001US-00957187 . 
XX 

PR 09-MAR-1999; 

PR 04-JAN-2000; 

PR 08-MAR-2000; 

PR 19-SEP-2000; 

PR 20-SEP-2000; 
XX 

PA (SHIM/) SHIMKETS R A. 

PA (LARO/) LAROCHELLE W J. 
XX 

PI Shimkets RA, Larochelle WJ; 
XX 

DR WPI; 2003-540616/51. 

DR N-PSDB; ADA23282. 
XX 

PT New SECX nucleic acids, useful for treating or diagnosing a disorder 

PT e.g., lung cancer, cardiovascular and oncology diseases, immune disorder, 

PT and autoimmune disease. 

XX 

PS Claim 12; Fig 3; 118pp; English. 
XX 

CC The present invention relates to the isolation of human secreted or 

CC membrane-associated (SECX) polypeptides designated SEC1-SEC15, and the 

CC polynucleotide sequences encoding them. Also disclosed is a method for 

CC screening for a modulator of activity or latency of SECX. The SECX 

CC polypeptide and polynucleotide sequences may be used for treating or 

CC preventing SECX-associated disorders such as lung cancer, cardiovascular 

CC and oncology diseases, immune disorders, autoimmune diseases, transplant 

CC rejection, allergy, AIDS, infections, inflammatory disorders, arthritis, 

CC haematopoietic disorders, skin disorders, atherosclerosis, restenosis, 

CC neurological diseases (e.g. Alzheimer's disease), trauma, wounds, spinal 

CC cord injuries, and skeletal disorders. The present sequence represents a 

CC SECX polypeptide of the invention. 
XX 

SQ Sequence 884 AA; 

Query Match 84.2%; Score 4590.5; DB 6; Length 884; 
Best Local Similarity 93.7%; Pred. No. 0; 

Matches 873; Conservative 2; Mismatches 2; Indels 55; Gaps 1; 

Qy 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I M I I I I 

Db 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

Qy 61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 
I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I II I I 



99US-0123667P. 
2000US-0174485P. 
2000US-00520781. 
2000US-0233798P. 
2000US-0234082P. 



Db 61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

Qy 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 180 

I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYD7VKHANVALFADG 180 

Qy 181 KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

Qy 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

Qy 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

I I I II I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I 
Db 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

Qy 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

Qy 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 480 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 480 

Qy 481 GVEDKRIMGMQLDRAS S S LYVAFSTCVI KVPLGRCERHGKCKKTCI AS RDP YCGWI KEGG 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I 
Db 481 GVEDKRIMGMQLDRAS SSL WAFSTCVIKVPLGRCERHGKCKKTCIAS RDP YCGWI KEGG 540 

Qy 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESR 600 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

Db 541 ACS HLS PNS RLTFEQDI ERGNTDGLGDCHNS FVALN 576 

Qy 601 GGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRESYLKGHDQLVPVTLLAIAVI LAFV 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 577 GVI RES YLKGHDQLVPVTLLAI AVI LAFV 605 

Qy 661 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 720 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 606 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 665 

Qy 721 AILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERN 780 

I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I : I I I I I I 
Db 666 AILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKREPSRGTREWERN 725 

Qy 781 QNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVAQ 840 

I I I I I I I I I II I I II I I I II II I I I I I II I I I I I I I I I I II I I I I I I I I I I I II I I I I I I 
Db 726 QNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVAQ 785 

Qy 841 MALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLSQTGL 900 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I 
Db 786 MALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLSQTGL 845 

Qy 901 SKRLEMHHSSSYGVDYKRSYPTNSLTRSHQAT 932 

I I I I I I I I I I I I I I I II I I I M I I I I I I I I 
Db 846 SKRLEMHHSSSYGVDYKRSYPTNSLTRSHLTT 877 



RESULT 9 
AAB95139 

ID AAB95139 standard; protein; 699 AA. 
XX 

AC AAB95139; 
XX 

DT 26-JUN-2001 (first entry) 
XX 

DE Human protein sequence SEQ ID NO: 17154. 
XX 

KW Human; primer; detection; diagnosis; antisense therapy; gene therapy. 
XX 

OS Homo sapiens. 
XX 

PN EP1074617-A2. 
XX 

PD 07-FEB-2001. 
XX 

PF 28-JUL-2000; 2000EP-00116126. 
XX 

PR 29-JUL-1999; 99 JP-00248036 . 

PR 27-AUG-1999; 99 JP-003002 53 . 

PR ll-JAN-2000; 2000 JP-00118776. 

PR 02-MAY-2000; 2000 JP-001837 67 . 

PR 09-JUN-2000; 2000 JP-0024 18 99 . 
XX 

PA (HELI-) HELIX RES INST. 
XX 

PI Ota T, Isogai T, Nishikawa T, Hayashi K, Saito K, Yamamoto J; 

PI Ishii S, Sugiyama T, Wakamatsu A, Nagai K, Otsuki T; 

XX 

DR WPI; 2001-318749/34. 
XX 

PT Primer sets for synthesizing polynucleotides, particularly the 5602 full- 

PT length cDNAs defined in the specification, and for the detection and/or 

PT diagnosis of the abnormality of the proteins encoded by the full-length 

PT cDNAs . 
XX 

PS Claim 8; SEQ ID NO 17154; 2537pp + Sequence Listing; English. 
XX 

CC The present invention describes primer sets for synthesising 5602 full- 

CC length cDNAs defined in the specification. Where a primer set comprises: 

CC (a) an oligo-dT primer and an oligonucleotide complementary to the 

CC complementary strand of a polynucleotide which comprises one of the 5602 

CC nucleotide sequences defined in the specification, where the 

CC oligonucleotide comprises at least 15 nucleotides; or (b) a combination 

CC of an oligonucleotide comprising a sequence complementary to the 

CC complementary strand of a polynucleotide which comprises a 5 ! -end 

CC sequence and an oligonucleotide comprising a sequence complementary to a 

CC polynucleotide which comprises a 3' -end sequence, where the 

CC oligonucleotide comprises at least 15 nucleotides and the combination of 

CC the 5 f -end sequence/3 ' -end sequence is selected from those defined in the 

CC specification. The primer sets can be used in antisense therapy and in 

CC gene therapy. The primers are useful for synthesising polynucleotides, 

CC particularly full-length cDNAs . The primers are also useful for the 



CC detection and/or diagnosis of the abnormality of the proteins encoded by 

CC the full-length cDNAs . The primers allow obtaining of the full-length 

CC cDNAs easily without any specialised methods. AAH03166 to AAH13628 and 

CC AAH13633 to AAH18742 represent human cDNA sequences; AAB92446 to AAB95893 

CC represent human amino acid sequences; and AAH13629 to AAH13632 represent 

CC oligonucleotides, all of which are used in the exemplification of the 

CC present invention 
XX 

SQ Sequence 699 AA; 

Query Match 67.5%; Score 3677; DB 4; Length 699; 

Best Local Similarity 100.0%; Pred. No. 1.2e-307; 

Matches 699; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


332 


MLDIASVFTGRFKEQKSPDSTWTPVPDERVPKPRPGCCAGSSSLERYATSNEFPDDTLNF 


391 






■ i ■ i i i i i i i i i i i i i i i i i i ■ i ■ i ■■ i ■ i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1 


MLDIASVFTGRFKEQKSPDSTWTPVPDERVPKPRPGCCAGSSSLERYATSNEFPDDTLNF 


60 


Qy 


392 


IKTHPLMDEAVPSIFNRPWFLRTMVRYRLTKIAVDTAAGPYQNHTVVFLGSEKGIILKFL 


451 






it iiiiiiiiiiiiiiiriilillllllllltlltllllltllllllllllllll 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


I KTHPLMDEAVP S I FNRPWFLRTMVRYRLTKI AVDTAAGP YQNHTWFLGSEKGI ILKFL 


i on 

120 


Qy 


452 


ARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVIKVP 


511 






1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


ARIGNSGFLNDSLFLEEMSWNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVIKVP 


Ton 
loU 


Qy 


512 


LGRCERHGKCKKTCIASRDPYCGWIKEGGACSHLSPNSRLTFEQDIERGNTDGLGDCHNS 


571 






t i i i i i ■ i i i i i i ■ t i i i i i i t i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i t 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


LGRCERHGKCKKTCIASRDPYCGWIKEGGACSHLSPNSRLTFEQDIERGNTDGLGDCHNS 


O /I n 

24 0 


Qy 


572 


FVALNGHSSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKK 


631 






i i i ■ i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


FVALNGHSSSLLPSTTTSDSTAQEGYESRGGMLDwKHLLDSPDSTDPLGAVbbHNHyDKK 


t n n 
JUL) 


Qy 


632 


GVI RESYLKGHDQLVPVTLLAIAVI LAFVMGAVFSGITVYCVCDHRRKDVAVVQRKEKEL 


691 






i i i i i i i i \ i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i 
II 1 1 1 1 1 1 1 II II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


GVI RESYLKGHDQLVPVTLLAIAVI LAFvMGAVFSGITVYCVCDHRRKDVAVVQRKlijKbL 




Qy 


692 


THSRRGSMSSVTKLSGLFGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLD 


751 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 




Db 


361 


THSRRGSMSSVTKLSGLFGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIK7VDQHHLD 


420 


Qy 


752 


LTALPTPESTPTLQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSH 


811 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


LTALPTPESTPTLQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSH 


480 


Qy 


812 


IPSVWLPITQQGYQHEYVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLV 


871 






1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 




Db 


481 


IPSVWLPITQQGYQHEYVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLV 


540 


Qy 


872 


ENLDSLPPKVPQREASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQA 


931 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


541 


ENLDSLPPKVPQREASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQA 


600 


Qy 


932 


TTLKRNNTNSSNSSHLSRNQSFGRGDNPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNA 


991 






II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 II 




Db 


601 


TTLKRNNTNSSNSSHLSRNQSFGRGDNPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNA 


660 



Qy 992 YNSLTRSGLKRTPSLKPDVPPKPSFAPLSTSMKPNDACT 1030 

I I I I I I I I II I I II I I I I I M I I I I I I I I I I I I I I I I I I 
Db 661 YNSLTRSGLKRTPSLKPDVPPKPSFAPLSTSMKPNDACT 699 



RESULT 10 
AAB23043 

ID AAB23043 standard; protein; 630 AA. 
XX 

AC AAB23043; 
XX 

DT 16-JAN-2001 (first entry) 
XX 

DE Human semaphorin protein-like splice variant, SECX pCR2 . 1-2864933 . 
XX 

KW SECX protein; human; secreted; membrane-associated; cancer; 

KW proliferation regulator; differentiation regulator; non-malignant tumour; 

KW immune disorder; autoimmune disease; transplant rejection; allergy; AIDS; 

KW infection; inflammatory disorder; arthritis; haematopoietic disorder; 

KW skin disorder; cardiovascular disorder; atherosclerosis; restenosis; 

KW neurological disease; Alzheimer's disease; trauma; wounding; 

KW spinal cord injury; skeletal disorder; cytostatic; immunosuppressive; 

KW anti-HIV; antiinflammatory; antiarthritic; antiarteriosclerotic; 

KW neuroprotective; vulnerary; antiallergic; antimicrobial; cardiant; 

KW derma tological; gene therapy. 

XX 

OS Homo sapiens. 
XX 

PN WO200053742-A2. 
XX 

PD 14-SEP-2000. 
XX 

PF 09-MAR-2000; 2000WO-US006280 . 
XX 

PR 09-MAR-1999; 99US-0123667P . 

PR 08-MAR-2000; 2000US-0520781P . 
XX 

PA (CURA-) CURAGEN CORP. 
XX 

PI Shimkets RA; 
XX 

DR WPI; 2000-594318/56. 

DR N-PSDB; AAA93630. 
XX 

PT Novel human membrane associated or secreted polypeptides and 

PT polynucleotides useful for diagnosis, prevention and treatment of 

PT pathological states such as cancer, immune, cardiovascular and 

PT neurological disorders. 

XX 

PS Claim 1; Fig 15; 151pp; English. 
XX 

CC Sequences AAB23029-B23048 represent human SECX proteins. The SECX 

CC proteins of the invention are either secreted or membrane-associated 

CC proteins and act as regulator of cellular proliferation and 

CC differentiation. SECX proteins or nucleotides are useful for diagnosing 

CC the presence of, or predisposition to, a disease associated with altered 



CC levels of SECX proteins and nucleotides. The SECX proteins are also 

CC useful to screen compounds that modulate SECX activity or expression. The 

CC interaction of a SECX protein with other cellular proteins may be useful 

CC to modulate the activity of a partner protein, cellular proliferation, 

CC cellular differentiation and cell survival. SECX nucleotides are useful 

CC for the recombinant expression of SECX protein, and may be used detect 

CC SECX mRNA or genetic lesions in the SECX gene. They may also be used to 

CC modulate SECX expression (e.g., using antisense oligonucleotides). SECX 

CC nucleic acid sequences are also useful for identifying a cell or tissue 

CC type in a biological sample, and in forensic biology. SECX primers or 

CC probes are useful for detecting the presence of SECX nucleotides and for 

CC screening tissue cultures for contamination. Diseases that may be treated 

CC or prevented using SECX proteins or nucleotides include cancer (e.g., 

CC colorectal carcinoma, prostate cancer), benign tumours, immune disorders 

CC (including autoimmune diseases, transplant rejection, allergies, AIDS), 

CC infections, inflammatory disorders, arthritis, haematopoietic disorders, 

CC skin disorders, cardiovascular disorders, atherosclerosis, restenosis, 

CC neurological diseases (e.g., Alzheimer's disease), trauma (e.g., surgical 

CC or traumatic wounds, spinal cord injury), and skeletal disorders 
XX 

SQ Sequence 630 AA; 



Query Match 61.9%; Score 3373; DB 3; Length 630; 

Best Local Similarity 99.8%; Pred. No. 1.8e-281; 

Matches 628; Conservative 1; Mismatches 0; Indels 0; Gaps 



Qy 


17 


GAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMIMIMNGTLYIA7VRDHI 


76 




I : M | I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II II II 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 




Db 


1 


GSGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMIMIMNGTLYIAARDHI 


60 


Qy 


77 


YTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNFIKVLLKKNDDALFVCG 


136 




I I I 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


YTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNFIKVLLKKNDDALFVCG 


120 


Qy 


137 


TNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADGKLYSATVTDFLAIDAV 


196 




I I I 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 




Db 


121 


TNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADGKLYSATVTDFLAIDAV 


180 


Qy 


197 


IYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVEYNTMGKWFPRVAQ 


256 




I | | | | | | | 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 




Db 


181 


IYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVEYNTMGKWFPRVAQ 


240 


Qy 


257 


VCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVTDVIRINGRDWLATFS 


316 






1 I I I M 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


VCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVTDVIRINGRDWLATFS 


300 


Qy 


317 


TPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDERVPKPRPGCCAGSSSLE 


376 




1 I I || 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 




Db 


301 


TPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDERVPKPRPGCCAGSSSLE 


360 


Qy 


377 


RYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRLTKIAVDTAAGPYQNHT 


436 




1 I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 




Db 


361 


RYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRLTKIAVDTAAGPYQNHT 


420 



437 WFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRAS 496 

| I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I II I I I II I II 
421 WFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRAS 480 



Qy 4 97 SSLYVAFSTCVIKVPLGRCERHGKCKKTCIASRDPYCGWIKEGGACSHLSPNSRLTFEQD 556 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I M 
Db 481 SSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGGACSHLSPNSRLTFEQD 540 

Qy 557 IERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDST 616 

I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 IERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDST 600 

Qy 617 DPLGAVSSHNHQDKKGVIRESYLKGHDQL 645 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 601 DPLGAVSSHNHQDKKGVIRESYLKGHDQL 629 



RESULT 11 
ADA23307 

ID ADA23307 standard; protein; 630 AA. 
XX 

AC ADA23307; 
XX 

DT 20-NOV-2003 (first entry) 
XX 

DE Human SECX polypeptide. 
XX 

KW Human; secreted polypeptide; membrane-associated polypeptide; SECX; SEC1; 

KW SEC2; SEC3; SEC4; SEC5; SEC6; SEC7; SEC8; SEC9; SEC10; SEC11; SEC12; 

KW SEC13; SEC14; SEC15; SECX-associated disorder; lung cancer; 

KW cardiovascular disease; oncology disease; immune disorder; 

KW autoimmune disease; transplant rejection; allergy; AIDS; infections; 

KW inflammatory disorder; arthritis; haematopoietic disorder; skin disorder; 

KW atherosclerosis; restenosis; neurological disease; Alzheimer's disease; 

KW trauma; wounds; spinal cord injury; skeletal disorder; cytostatic; 

KW antiinflammatory; immunosuppressive; anti-HIV; antiarthritic; 

KW antiarteriosclerotic; cardiant; neuroprotective; nootropic; vulnerary; 

KW antiallergic; cardiant; dermatological . 

XX 

OS Homo sapiens. 
XX 

PN US2003054514-A1. 
XX 

PD 20-MAR-2003. 
XX 

PF 19-SEP-2001; 2001US-00957187 . 
XX 

PR 09-MAR-1999; 99US-0123667P . 

PR 04-JAN-2000; 2000US-0174485P . 

PR 08-MAR-2000; 2000US-00520781 . 

PR 19-SEP-2000; 2000US-0233798P . 

PR 20-SEP-2000; 2000US-0234082P . 
XX 

PA (SHIM/) SHIMKETS R A. 

PA (LARO/) LAROCHELLE W J. 
XX 

PI Shimkets RA, Larochelle WJ; 
XX 

DR WPI; 2003-540616/51. 

DR N-PSDB; ADA23306. 



XX 

PT New SECX nucleic acids, useful for treating or diagnosing a disorder 

PT e.g., lung cancer, cardiovascular and oncology diseases, immune disorder, 

PT and autoimmune disease. 

XX 

PS Claim 12; Fig 15; 118pp; English. 
XX 

CC The present invention relates to the isolation of human secreted or 

CC membrane-associated (SECX) polypeptides designated SEC1-SEC15, and the 

CC polynucleotide sequences encoding them. Also disclosed is a method for 

CC screening for a modulator of activity or latency of SECX. The SECX 

CC polypeptide and polynucleotide sequences may be used for treating or 

CC preventing SECX-associated disorders such as lung cancer, cardiovascular 

CC and oncology diseases, immune disorders, autoimmune diseases, transplant 

CC rejection, allergy, AIDS, infections, inflammatory disorders, arthritis, 

CC haematopoietic disorders, skin disorders, atherosclerosis, restenosis, 

CC neurological diseases (e.g. Alzheimer's disease), trauma, wounds, spinal 

CC cord injuries, and skeletal disorders. The present sequence represents a 

CC SECX polypeptide of the invention. 
XX 

SQ Sequence 630 AA; 

Query Match 61.9%; Score 3373; DB 6; Length 630; 

Best Local Similarity 99.8%; Pred. No. 1.8e-281; 

Matches 628; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

GAGFPEDSEPI SI SHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMIMIMNGTLYIAARDHI 7 6 
I : I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I 
GSGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMIMIMNGTLYIAARDHI 60 

YTVXHDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNFIKVLLKKNDD7VLFVCG 136 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
YTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNFIKVLLKKNDDALFVCG 12 0 

TNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADGKLYSATVTDFLAIDAV 196 

I | | | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
TNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADGKLYSATVTDFLAIDAV 180 

IYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVEYNTMGKWFPRVAQ 256 

I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
IYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVEYNTMGKWFPRVAQ 24 0 

VCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVTDVIRINGRDWLATFS 316 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
vnKNnMGKSORVLEKOWTSFLKARLNCSVPGDSHFYFNILOAVTDVIRINGRDVVLATFS 300 



I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I II I I I I I I 
TPYNS I PGSAVCAYDMLDI ASVFTGRFKEQKS PDSTWTPVPDERVPKPRPGCCAGS S SLE 360 

RYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRLTKIAVDTAAGPYQNHT 436 

M I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I M II I I I I I I I I I I I I I I II II I I I 
RYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRLTKIAVDTAAGPYQNHT 42 0 

WFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRAS 4 96 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I 
WFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRAS 4 80 



QY 


17 


Db 


1 


Qy 


77 


Db 


61 


Qy 


137 


Db 


121 


Qy 


197 


Db 


181 


Qy 


257 


Db 


241 


Qy 


317 


Db 


301 


Qy 


377 


Db 


361 


Qy 


437 


Db 


421 



Qy 497 SSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGGACSHLSPNSRLTFEQD 556 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 
Db 481 SSLYVAFSTCVI KVPLGRCERHGKCKKTCI AS RDPYCGWIKEGGACSHLSPNSRLTFEQD 540 

Qy 557 IERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDST 616 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 541 IERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDST 600 

Qy 617 DPLGAVSSHNHQDKKGVI RESYLKGHDQL 645 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 601 DPLGAVSSHNHQDKKGVI RESYLKGHDQL 629 



RESULT 12 
ADA23360 

ID ADA23360 standard; protein; 626 AA. 
XX 

AC ADA23360; 
XX 

DT 20-NOV-2003 (first entry) 
XX 

DE Human SECX polypeptide, SEC3 #2. 
XX 

KW Human; secreted polypeptide; membrane-associated polypeptide; SECX; SEC1; 

KW SEC2; SEC3; SEC4 ; SEC5; SEC6; SEC7; SEC8; SEC9; SEC10; SEC11; SEC12; 

KW SEC13; SEC14; SEC15; SECX-associated disorder; lung cancer; 

KW cardiovascular disease; oncology disease; immune disorder; 

KW autoimmune disease; transplant rejection; allergy; AIDS; infections; 

KW inflammatory disorder; arthritis; haematopoietic disorder; skin disorder; 

KW atherosclerosis; restenosis; neurological disease; Alzheimer's disease; 

KW trauma; wounds; spinal cord injury; skeletal disorder; cytostatic; 

KW antiinflammatory; immunosuppressive; anti-HIV; antiarthritic; 

KW antiarteriosclerotic; cardiant; neuroprotective; nootropic; vulnerary; 

KW antiallergic; cardiant; dermatological . 

XX 

OS Homo sapiens. 
XX 

PN US2003054514-A1. 
XX 

PD 20-MAR-2003. 
XX 

PF 19-SEP-2001; 2001US-00957 187 . 
XX 

PR 09-MAR-1999; 99US-0123667 P . 

PR 04-JAN-2000; 2000US-0174485P . 

PR 08-MAR-2000; 2000US-00520781 . 

PR 19-SEP-2000; 2000US-0233798P . 

PR 20-SEP-2000; 2000US-0234 082P . 
XX 

PA (SHIM/) SHIMKETS R A. 

PA (LARO/) LAROCHELLE W J. 
XX 

PI Shimkets RA, Larochelle WJ; 
XX 

DR WPI; 2003-540616/51. 

DR N-PSDB; ADA23359. 



XX 

PT New SECX nucleic acids, useful for treating or diagnosing a disorder 

PT e.g., lung cancer, cardiovascular and oncology diseases, immune disorder, 

PT and autoimmune disease. 

XX 

PS Claim 12; Page 7; 118pp; English. 
XX 

CC The present invention relates to the isolation of human secreted or 

CC membrane-associated (SECX) polypeptides designated SEC1-SEC15, and the 

CC polynucleotide sequences encoding them. Also disclosed is a method for 

CC screening for a modulator of activity or latency of SECX. The SECX 

CC polypeptide and polynucleotide sequences may be used for treating or 

CC preventing SECX-associated disorders such as lung cancer, cardiovascular 

CC and oncology diseases, immune disorders, autoimmune diseases, transplant 

CC rejection, allergy, AIDS, infections, inflammatory disorders, arthritis, 

CC haematopoietic disorders, skin disorders, atherosclerosis, restenosis, 

CC neurological diseases (e.g. Alzheimer's disease), trauma, wounds, spinal 

CC cord injuries, and skeletal disorders. The present sequence represents a 

CC SECX polypeptide of the invention. 
XX 

SQ Sequence 626 AA; 

Query Match 61.7%; Score 3362; DB 6; Length 626; 

Best Local Similarity 100.0%; Pred. No. 1.6e-280; 

Matches 626; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

GFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMIMIMNGTLYIAARDHIYT 7 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMIMIMNGTLYIAARDHIYT 60 

VDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNFIKVLLKKNDDALFVCGTN 138 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
VDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNFIKVLLKKNDDALFVCGTN 12 0 

AFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADGKLYSATVTDFLAIDAVIY 198 

I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADGKLYSATVTDFLAIDAVIY 180 

RSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVEYNTMGKWFPRVAQVC 258 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 

RSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVEYNTMGKWFPRVAQVC 2 4 0 

KNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVTDVIRINGRDWLATFSTP 318 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
KNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVTDVIRINGRDWLATFSTP 300 

YNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDERVPKPRPGCCAGSSSLERY 378 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
YNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDERVPKPRPGCCAGSSSLERY 360 

ATSNEFPDDTLNFI KTHPLMDEAVPS I FNRPWFLRTMVRYRLTKIAVDTAAGP YQNHTW 438 

I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
ATSNEFPDDTLNFI KTHPLMDEAVPS I FNRPWFLRTMVRYRLTKIAVDTAAGPYQNHTVV 420 

FLGSEKGIILKFIARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRASSS 4 98 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

FLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRASSS 480 



Qy 


19 


Db 


1 


Qy 


79 


Db 


61 


Qy 


139 


Db 


121 


Qy 


199 


Db 


181 


Qy 


259 


Db 


241 


Qy 


319 


Db 


301 


Qy 


379 


Db 


361 


Qy 


439 


Db 


421 



Qy 4 99 LYVAFSTCVI KVPLGRCERHGKCKKTCIAS RDPYCGWIKEGGACSHLSPNSRLTFEQDIE 558 

I I I I I I I I I II I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I 
Db 481 LYVAFSTCVI KVPLGRCERHGKCKKTCIAS RDPYCGWIKEGGACSHLSPNSRLTFEQDIE 540 

Qy 559 RGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDP 618 

I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I 
Db 541 RGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDP 600 

Qy 619 LGAVS SHNHQDKKGVI RES YLKGHDQ 644 

I I II I I I I I I I I I I I I I M 1 I I I I I I 
Db 601 LGAVS SHNHQDKKGVI RES YLKGHDQ 626 



RESULT 13 
AAB95317 

ID AAB95317 standard; protein; 574 AA. 
XX 

AC AAB95317; 
XX 

DT 26-JUN-2001 (first entry) 
XX 

DE Human protein sequence SEQ ID NO: 17568. 
XX 

KW Human; primer; detection; diagnosis; antisense therapy; gene therapy. 
XX 

OS Homo sapiens . 
XX 

PN EP1074617-A2. 
XX 

PD 07-FEB-2001. 
XX 

PF 28-JUL-2000; 2000EP-00116126 . 
XX 

PR 29-JUL-1999; 99 JP-00248036 . 

PR 27-AUG-1999; 99 JP-00300253 . 

PR ll-JAN-2000; 2000 JP-00118776 . 

PR 02-MAY-2000; 2000 JP-00183767 . 

PR 09-JUN-2000; 2000 JP-00241899 . 
XX 

PA (HELI-) HELIX RES INST. 
XX 

PI Ota T, Isogai T, Nishikawa T, Hayashi K, Saito K, Yamamoto J; 

PI Ishii S, Sugiyama T, Wakamatsu A, Nagai K, Otsuki T; 

XX 

DR WPI; 2001-318749/34. 
XX 

PT Primer sets for synthesizing polynucleotides, particularly the 5602 full- 

PT length cDNAs defined in the specification, and for the detection and/or 

PT diagnosis of the abnormality of the proteins encoded by the full-length 

PT cDNAs . 
XX 

PS Claim 8; SEQ ID NO 17568; 2537pp + Sequence Listing; English. 
XX 

CC The present invention describes primer sets for synthesising 5602 full- 

CC length cDNAs defined in the specification. Where a primer set comprises: 

CC (a) an oligo-dT primer and an oligonucleotide complementary to the 



CC complementary strand of a polynucleotide which comprises one of the 5602 

CC nucleotide sequences defined in the specification, where the 

CC oligonucleotide comprises at least 15 nucleotides; or (b) a combination 

CC of an oligonucleotide comprising a sequence complementary to the 

CC complementary strand of a polynucleotide which comprises a 5' -end 

CC sequence and an oligonucleotide comprising a sequence complementary to a 

CC polynucleotide which comprises a 3* -end sequence, where the 

CC oligonucleotide comprises at least 15 nucleotides and the combination of 

CC the 5' -end sequence/3 1 -end sequence is selected from those defined in the 

CC specification. The primer sets can be used in antisense therapy and in 

CC gene therapy. The primers are useful for synthesising polynucleotides, 

CC particularly full-length cDNAs . The primers are also useful for the 

CC detection and/or diagnosis of the abnormality of the proteins encoded by 

CC the full-length cDNAs . The primers allow obtaining of the full-length 

CC cDNAs easily without any specialised methods. AAH03166 to AAH13628 and 

CC AAH13633 to AAH18742 represent human cDNA sequences; AAB92446 to AAB95893 

CC represent human amino acid sequences; and AAH13629 to AAH13632 represent 

CC oligonucleotides, all of which are used in the exemplification of the 

CC present invention 

XX 

SQ Sequence 574 AA; 

Query Match 54.7%; Score 2980.5; DB 4; Length 574; 

Best Local Similarity 90.7%; Pred. No. 1.2e-247; 

Matches 574; Conservative 0; Mismatches 0; Indels 59; Gaps 1; 



I I I I I I I I I I I I I I I I I I I I 

MDEAVP S I FNRPWFLRTMVR 20 



Qy 


398 


Db 


1 


Qy 


458 


Db 


21 


Qy 


518 


Db 


62 


Qy 


578 


Db 


122 


Qy 


638 


Db 


182 


Qy 


698 


Db 


242 


Qy 


758 


Db 


302 


Qy 


818 


Db 


362 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 



HSSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRES 637 
I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I II I 
HSSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RES 181 

YLKGHDQLVPVTLLAIAVILAFVMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRG 697 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I II I I I I I I I I I I I I I 
YLKGHDQLVPWLIAIAVILAFVMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRG 241 

SMSSVTKLSGLFGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPT 757 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I 
SMSSVTKLSGLFGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPT 301 

PESTPTLQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVW 817 
I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I II I I I I I I I I I 



LPITQQGYQHEYVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSL 877 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
LPITQQGYQHEYVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSL 421 



Qy 878 PPKVPQREASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRN 937 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I M N I I I 
Db 422 PPKVPQREASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRN 481 

Qy 93 8 NTNSSNSSHLSRNQSFGRGDNPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTR 997 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I II M I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 482 NTNSSNSSHLSRNQSFGRGDNPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTR 541 

Qy 998 S GLKRT P S LK P DVP P KP S FAP L S T SMKPNDACT 1030 

I || I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
Db 542 SGLKRTPSLKPDVPPKPSFAPLSTSMKPNDACT 574 



RESULT 14 
AAB94104 

ID AAB94104 standard; protein; 562 AA. 
XX 

AC AAB94104; 
XX 

DT 26-JUN-2001 (first entry) 
XX 

DE Human protein sequence SEQ ID NO: 14328. 
XX 

KW Human; primer; detection; diagnosis; antisense therapy; gene therapy. 
XX 

OS Homo sapiens. 
XX 

PN EP1074617-A2. 
XX 

PD 07-FEB-2001. 
XX 

PF 28-JUL-2000; 2000EP-00116126 . 
XX 

PR 29-JUL-1999; 99 JP-00248036. 

PR 27-AUG-1999; 99 JP-00300253 . 

PR ll-JAN-2000; 2000 JP-00118776 . 

PR 02-MAY-2000; 2000 JP-00183767 . 

PR 09-JUN-2000; 2000 JP-00241899 . 
XX 

PA (HELI-) HELIX RES INST . 
XX 

PI Ota T, Isogai T, Nishikawa T, Hayashi K, Saito K, Yamamoto J; 

PI Ishii S, Sugiyama T, Wakamatsu A, Nagai K, Otsuki T; 

XX 

DR WPI; 2001-318749/34. 
XX 

PT Primer sets for synthesizing polynucleotides, particularly the 5602 full- 

PT length cDNAs defined in the specification, and for the detection and/or 

PT diagnosis of the abnormality of the proteins encoded by the full-length 

PT cDNAs . 
XX 

PS Claim 8; SEQ ID NO 14328; 2537pp + Sequence Listing; English. 
XX 

CC The present invention describes primer sets for synthesising 5602 full- 

CC length cDNAs defined in the specification. Where a primer set comprises: 

CC (a) an oligo-dT primer and an oligonucleotide complementary to the 



CC complementary strand of a polynucleotide which comprises one of the 5602 

CC nucleotide sequences defined in the specification, where the 

CC oligonucleotide comprises at least 15 nucleotides; or (b) a combination 

CC of an oligonucleotide comprising a sequence complementary to the 

CC complementary strand of a polynucleotide which comprises a 5 '-end 

CC sequence and an oligonucleotide comprising a sequence complementary to a 

CC polynucleotide which comprises a 3 f -end sequence, where the 

CC oligonucleotide comprises at least 15 nucleotides and the combination of 

CC the 5' -end sequence/3 1 -end sequence is selected from those defined in the 

CC specification. The primer sets can be used in antisense therapy and in 

CC gene therapy. The primers are useful for synthesising polynucleotides, 

CC particularly full-length cDNAs . The primers are also useful for the 

CC detection and/or diagnosis of the abnormality of the proteins encoded by 

CC the full-length cDNAs . The primers allow obtaining of the full-length 

CC cDNAs easily without any specialised methods. AAH03166 to AAH1362 8 and 

CC AAH13633 to AAH18742 represent human cDNA sequences; AAB92446 to AAB95893 

CC represent human amino acid sequences; and AAH13629 to AAH13632 represent 

CC oligonucleotides, all of which are used in the exemplification of the 

CC present invention 

XX 

SQ Sequence 562 AA; 

Query Match 54.2%; Score 2952; DB 4; Length 562; 

Best Local Similarity 100.0%; Pred. No. 3.2e-245; 

Matches 562; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 4 69 MSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVIKVPLGRCERHGKCKKTCIAS 52 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I 
Db 1 MSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIAS 60 

Qy 529 RDPYCGWIKEGGACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTT 588 

I | | | | | | | | I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I M M I I I I M I I I I I I i I I I 
Db 61 RDP YCGWI KEGGACSHLS PNSRLTFEQDI ERGNTDGLGDCHNS FVALNGHS S SLLPSTTT 120 

Qy 589 SDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRESYLKGHDQLVPV 648 

1 1 1 1 1 1 1 1 1 M 1 1 ii 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 ii 1 1 1 m 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I I I I I I 

Db 121 SDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHDQLVPV 180 

Qy 649 TLLAIAVILAFVMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGL 708 

| | | | I I I I I I I I I I II II I I I I I I I I I I I I I I I II I I I I I I I I I I I M I I I I I I I 

Db 181 TLLAIAVILAFVMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGL 240 

Qy 709 FGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKR 768 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 FGDTQSKDPKPEAI LTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKR 300 

Q y 769 KPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSWVLPITQQGYQHE 828 

I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

D b 301 KPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHE 360 

Qy 829 YVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASL 888 

| I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 

Db 361 YVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASL 420 

Qy 889 GPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLS 948 

| | I I I I I I I I I M I I I II I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I I 
Db 421 GPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLS 480 



Qy 949 RNQSFGRGDNPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKP 1008 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 

Db 481 RNQSFGRGDNPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKP 540 

Qy 1009 DVPPKPSFAPLSTSMKPNDACT 1030 

I II I I II I I I I I I I I II I I I I I 
Db 541 DVPPKPSFAPLSTSMKPNDACT 562 



RESULT 15 
AAM93444 

ID AAM93444 standard; protein; 562 AA. 
XX 

AC AAM93444; 
XX 

DT 06-NOV-2001 (first entry) 
XX 

DE Human polypeptide, SEQ ID NO: 3088. 
XX 

KW Human; full length cDNA; cDNA synthesis; oligo-capping . 
XX 

OS Homo sapiens. 
XX 

PN EP1130094-A2. 
XX 

PD 05-SEP-2001. 
XX 

PF 07-JUL-2000; 2000EP-00114089 . 
XX 

PR 08-JUL-1999; 99 JP-00194486 . 

PR ll-JAN-2000; 2000 JP-00118774 . 

PR 02-MAY-2000; 2000 JP-00183765 . 
XX 

PA (HELI-) HELIX RES INST. 
XX 

PI Ota T, Nishikawa T, Isogai T, Hayashi K, Ishii S, Kawai Y; 

PI Wakamatsu A, Sugiyama T, Nagai K, Kojima S, Otsuki T, Koga H; 
XX 

DR WPI; 2001-524255/58. 

DR N-PSDB; AAK94365. 
XX 

PT 830 Primers useful for synthesizing full length cDNA clones and their use 

PT in genetic manipulation. 

XX 

PS Claim 8; SEQ ID NO 3088; 1380pp + Sequence Listing; English. 
XX 

CC The invention relates to primers for synthesising full length cDNA 

CC clones. 830 cDNA molecules encoding a human protein have been isolated 

CC and nucleotide sequences of 5 1 - and 3 1 -ends of the cDNA molecules have 

CC been determined. Primers for synthesising the full length cDNA are useful 

CC for clarifying the function of the protein encoded by the cDNA. The full 

CC length clones were obtained by construction of full length enriched cDNA 

CC libraries that were synthesised by the oligo-capping method. The primers 

CC enable the production of the full length cDNA easily without any special 

CC methods. The present sequence is a polypeptide encoded by a full length 

CC human cDNA of the invention. Note: The sequence data for this patent did 



CC not form part of the printed specification, but was obtained in CD-ROM 

CC format directly from EPO 

XX 

SQ Sequence 562 AA; 

Query Match 54.1%; Score 2947; DB 4; Length 562; 

Best Local Similarity 99.8%; Pred. No. 8.7e-245; 

Matches 561; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 4 69 MSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIAS 528 

I I I I I I I I I II I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I II 

Db 1 MSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIAS 60 

Qy 529 RDPYCGWIKEGGACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTT 588 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
Db 61 RDPYCGWIKEGGACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTT 120 

Qy 589 SDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHDQLVPV 648 

I | I I I I I I I I I I II I II II I I I I I I M I I I I I I I I I I I M I I I I I I I II 

Db 121 SDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHDQLVPV 180 

Qy 649 TLLAIAVILAFVMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGL 708 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 TLLAIAVILAFVMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGL 240 

Qy 709 FGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKR 768 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I II I I I I I I I I I I I I I I 
Db 241 FGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKR 300 

Qy 7 69 KPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHE 82 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I II I I I I I I I I I I I I I I I I I I I I 
Db 301 KPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSWVLPITQQGYQHE 360 

Qy 829 YVDQPKMS EVAQMALEDQAATLEYKT I KEHLS S KS PNHGVNLVENLDS LP PKVPQREAS L 888 

I | I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I M I I I M I I I I I I I I I I I I I I 

Db 361 YVDQPKMS EVAQMALEDQAATLEYKTI KEHLS SKSPNHGVNLVENLDSLPPKVPQREASL 420 

Qy 889 GPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLS 948 

I I I I I I I I I I I I I I I I I I I I II I II I I M I I I I I I I I I M I I I I I I I II I I I I I I I I I I I 
Db 421 GPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLS 480 

Qy 949 RNQSFGRGDNPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKP 1008 

M I I I I I I I : I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 RNQSFGRGDSPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKP 540 

Qy 1009 DVPPKPSFAPLSTSMKPNDACT 1030 

I I I I I I I I I I I I I I I I I I I I I I 
Db 541 DVPPKPSFAPLSTSMKPNDACT 562 



Search completed: March 24, 2004, 13:14:21 
Job time : 104.009 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
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(without alignments) 
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Scoring table: BLOSUM62 
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RESULT 1 
US-09-653-274-4 

Sequence 4, Application US/09653274 
Patent No. 6635742 
GENERAL INFORMATION: 
APPLICANT : Boyle, Bryan J 
APPLICANT: Yeung, George Y 
APPLICANT: Arterburn, Matthew C 
APPLICANT: Mize, Nancy K 
APPLICANT: Tang, Y. Tom 
APPLICANT: Liu, Chenghua 
APPLICANT: Drmanac, Radoje T 

TITLE OF INVENTION: Methods and Maaterials Relating to Semaphorin-Like 
TITLE OF INVENTION: Polypeptides and Polynucleotides 
FILE REFERENCE: HYS-2 3 

CURRENT APPLICATION NUMBER: US/09/653, 274 
CURRENT FILING DATE: 2000-08-31 
PRIOR APPLICATION NUMBER: 09/491,404 



PRIOR FILING DATE: 2000-01-10 
NUMBER OF SEQ ID NOS : 13 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 4 
LENGTH: 108 6 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-653-274-4 

Query Match 43.3%; Score 2361.5; DB 4; Length 1086; 

Best Local Similarity 45.5%; Pred. No. 2.2e-215; 

Matches 512; Conservative 159; Mismatches 308; Indels 147; Gaps 30; 

Qy 1 MRSEALLLYFTLL HFAGAGFPEDSEPI S I SHGNYTKQYP VFVGHKPGRNTTQRHRLD 57 

II I I II I I I I I I : : : I : : I I I I I I : I I : I I I I I 

Db 1 MRVFLLCAYILLLMVSQLRAVSFPEDDEPLNTVDYHYSRQYPVFRG-RPSGNESQ-HRLD 58 

Qy 58 IQMIMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDEC 117 

I : : : : I I I I I I I : I I I : : : I : : I I I I I : I I I I : I I II II I I I I 

Db 59 FQLMLKIRDTLYIAGRDQVYTVNLNEMPKTEVI PNKKLTWRSRQQDRENCAMKGKHKDEC 118 

Qy 118 HNFIKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALF 177 

I I I I I I : : I I : : I I I I I I I I I I II I : : III I : I I I : I I I I : I I : Mill 
Db 119 HNFIKVFVPRNDEMVFVCGTNAFNPMCRYYRLSTLEYDGEEISGLARCPFDARQTNVALF 178 

Qy 178 ADGKLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFR 237 

I I I I I I I I I I I II I I I I I M I : I : I I I : I : I I I I : I I I : I : I : : I I : I : I I I I I 
Db 179 ADGKLYSATVADFLASDAVIYRSMGDGSALRTIKYDSKWIKEPHFLHAIEYGNYVYFFFR 238 

Qy 238 EIAVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQ 2 97 

I I I I I : I : I I I : I I I : : II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I : : I I 
Db 239 EIAVEHNNLGKAVYSRVARICKNDMGGSQRVLEKHWTSFLKARLNCSVPGDPFFYFDVLQ 2 98 

Qy 298 AVTDVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVP 357 

: : I I : I : I I I I : I : I I I I I I I I I I I : I II II I I I I I I I : I I I II II 

Db 299 SITDI IQINGI PTWGVFTTQLNS I PGSAVCAFSMDDIEKVFKGRFKEQKTPDSVWTAVP 358 

Qy 358 DERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVR 417 

:::|MIIIIIII I I II : I II : I I : II I : I I II I III I : III :| II 

Db 359 EDKVPKPRPGCCAKHGLAEAYKTSIDFPDETLSFIKSHPLMDSAVPPIADEPWFTKTRVR 418 

Qy 418 YRLTKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGF-LNDSLFLEEMSVYNSEK 476 

I I I I I : I I : I I I II I : I I : I : I I I I :: I I I I : I I I I I I : II I : II I 

Db 419 YRLTAI SVDHSAGPYQNYTVT FVGSEAGMVLKVLAK — TSPFSLNDSVLLEEIEAYNHAK 476 

Qy 477 CSYDGVEDKRIMGMQLDRASSSLYVAFSTCVIKVPLGRCERHGKCKKTCIASRDPYCGWI 536 

II : |||::: :|||: : I I I I II : I : I : : I I Mlhl I I I : I I II I I I I I I I : 

Db 477 CSAENEEDKKVISLQLDKDHHALYVAFSSCIIRIPLSRCERYGSCKKSCIASRDPYCGWL 536 

Qy 537 KEGGACSHLSPNSRLT FEQDIERGNTDGLGDCHNSFVALNGHSSSL 582 

: I : I : : I I : I I I I I I I I I II I : 
Db 537 SQ-GSCGRVTPGMLLLTEDFFAFHNHSAEGYEQDTEFGNTAHLGDCH EI 58 4 

Qy 583 LPSTTTSDSTAQEGYESRGG MLD-WKHLL 610 

I I : : I I I 1:11 ::| |: I 

Db 585 LPTSTTPD YKI FGGPTSDMEVSSS SVTTMAS I PEITPKVI DTWRPKLTS S RKFV 638 



Qy 611 — DSP DSTDPLGAVSSHNHQDKKGVIRESYLKGHDQLVPVTLLAIAVILAFVMGAVF 665 

II I I I I I : III I : | : | : : I I I I I : I I 

Db 639 VQDDPNTSDFTDPLSGI PKGVRWEVQS GESNQMVHMNVLI TCVFAAFVLGAFI 691 

Qy 666 SGITVYCVCD-HRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFG DTQSKDPKPE 720 

: I : I I I I II: : hi I I I I I : I I I : I I = 

Db 692 AGVAVYCYRDMFVRKNRKI— HKDAESAQSCTDSSGSFAKLNGLFDSPVKEYQQNIDSPK 74 9 

Qy 721 AILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERN 780 

I : I I : I I : : : I I I I I I I I I I I I I : I : : 

Db 750 LYSNLLTSRKELPPNGDTKSMVMDHRGQPPELAALPTPESTPVLHQKTLQAMKSHSEKAH 809 

Qy 7 81 QNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEY 829 

: I I : I I 1111111:111 I : 

Db 810 GH — GASRKETPQFFPSSPPPHSPL — SHGHIPSAIVLPNATHDYNTSFSNSNAHKAEKK 865 

Qy 830 VDQP KMS EVAQMAL EDQAAT LEYKTIKEHLSSKSPN HGVNLVE 872 

: I I I : I II : : | | : : | I : :: 

Db 866 LQNIDHPLTKSSSKRDHRRSVDSRNTL-- NDLLKHLNDPNSNPKAIMGDIQMAHQNLMLD 923 

Qy 873 NLDSL PPKVPQREASLGPPGASLSQTGLSKRLEMHHSSSY GVDYKRSYPTNSLT 926 

: I : I I I I I I I I I I I : : I : : I I : : : : : : : I I II 

Db 924 PMGSMSEVPPKVPNREASLYSPPSTLPRNSPTKRVDVPTTPGVPMTSLERQRGYHKNSSQ 983 

Qy 927 RSHQATTLKRNNTNSSNSSHLSRNQSFGRGD-NPPPAPQRVDSIQVHSSQPSGQAVTVSR 985 

II : : : I I I I III I II II : I I I I I I : I 

Db 984 R-HSISAMPK-NLNSPNGVLLSRQPSMNRGGYMPTPTGAKVDYIQ GTPVSVHL 1034 

Qy 986 QPSLNAYNSLT RSGLKRTPSLKPDVPPKPSFAPLSTSMKP 1025 

INI: : I I I : I I I I I I I I I I I I I I I I I I I : I :: I 

Db 1035 QPSLSRQSSYTSNGTLPRTGLKRTPSLKPDVPPKPSFVPQTPSVRP 1080 



RESULT 2 
US-09-653-274-8 

Sequence 8, Application US/09653274 
Patent No. 6635742 
GENERAL INFORMATION: 
APPLICANT: Boyle, Bryan J 
APPLICANT: Yeung, George Y 
APPLICANT: Arterburn, Matthew C 
APPLICANT: Mize, Nancy K 
APPLICANT: Tang, Y. Tom 
APPLICANT: Liu, Chenghua 
APPLICANT: Drmanac, Radoje T 

TITLE OF INVENTION: Methods and Maaterials Relating to Semaphorin-Like 
TITLE OF INVENTION: Polypeptides and Polynucleotides 
FILE REFERENCE: HYS-23 

CURRENT APPLICATION NUMBER: US/ 09/ 653 , 274 
CURRENT FILING DATE: 2000-08-31 
PRIOR APPLICATION NUMBER: 09/491,404 
PRIOR FILING DATE: 2000-01-10 
NUMBER OF SEQ ID NOS : 13 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 8 
LENGTH: 1070 
TYPE: PRT 



; ORGANISM: Homo sapiens 
US-09-653-274-8 



Query Match 43.2%; Score 2354; DB 4; Length 1070; 

Best Local Similarity 45.8%; Pred. No. l.le-214; 

Matches 506; Conservative 159; Mismatches 295; Indels 144; Gaps 29; 

Qy 20 FPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMIMIMNGTLYIAARDHIYTV 7 9 

I I I I I I : : : I : : I I I I I I : I I : I I I I I I : : : : I II I I I I : I I I 

Db 7 FPEDDEPLNTVDYHYSRQYPVFRG-RPSGNESQ-HRLDFQLMLKIRDTLYIAGRDQVYTV 64 

Qy 8 0 DIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNFIKVXLKKNDDALFVCGTNA 139 

::: I : : II I I I : I I I I : I I I I I I I I I I I I I I I I : : I I : : I I I I I I I 

Db 65 NLNEMPKTEVIPNKKLTWRSRQQDRENCAMKGKHKDECHNFIKVFVPRNDEMVFVCGTNA 124 

Qy 14 0 FNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADGKLYSATVTDFLAIDAVIYR 199 

III II I : : III I : I I I : I I I I : I I : I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 125 FNPMCRYYRLSTLEYDGEEISGLARCPFD7VRQTNVALFADGKLYSATVADFLASDAVIYR 184 

Qy 200 SLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVEYNTMGKWFPRVAQVCK 259 

|:|: I I I : I : I I I I : I I I : I : I :: I I : I : I I I I I I I I I I : I :M h lll::ll 
Db 185 SMGDGSALRTIKYDSKWIKEPHFLHAIEYGNYVYFFFREIAVEHNNLGKAVYSRVARICK 244 

Qy 260 NDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVTDVIRINGRDWLATFSTPY 319 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I : : I I : : I I : I : I I I I : I : I 
Db 245 NDMGGSQRVLEKHWTSFLKARLNCSVPGDPFFYFDVLQSITDIIQINGIPTWGVFTTQL 304 

Qy 320 NSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDERVPKPRPGCCAGSSSLERYA 379 

I I I I I I I I I I : I II II 1111111:111 II I I ::: I I I I I I I I I I I I 

Db 305 NSIPGSAVCAFSMDDIEKVFKGRFKEQKTPDSVWTAVPEDKVPKPRPGCCAKHGLAEAYK 364 

Qy 380 TSNEFPDDTLNFIKTHP LMD EAVP S I FN R P W FL RTMVR YRLT K I AVDT AAG P YQNHT WF 4 39 

II : | | | : | I : I I I : I I I I I I I I I : III : I I I I I I I I : I I : I I I I I I : I I : I 

Db 365 TSIDFPDETLSFIKSHPLMDSAVPPIADEPWFTKTRVRYRLTAISVDHSAGPYQNYTVIF 424 

Qy 440 LGSEKGIILKFLARIGNSGF-LNDSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRASSS 498 

: I I I I : : I I I I : I I I I I I : I I I : II III: I I I : : : : I I I : : 
Db 425 VGSEAGMVLKVliAK — TSPFSLNDSVLLEEIEAYNHAKCSAENEEDKKVISLQLDKDHHA 4 82 

Qy 4 99 LYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWI KEGGACSHLS PNSRLT 552 

I I I I I I : I : I : : II I I I I : I I II : I I I I I I I I I I I : : hi : : I I 
Db 483 LYVAFSSCIIRIPLSRCERYGSCKKSCIASRDPYCGWLSQ-GSCGRVTPGMLLLTEDFFA 541 

Qy 553 FEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESRGG — 602 

: I I I I I I I I I I I I : | | : : I I I I : I I 

Db 542 FHNHSAEGYEQDTEFGNTAHLGDCH EILPTSTTPD YKIFGGPT 584 

Qy 603 MLD-WKHLL DSP DSTDPLGAVSSHNH 627 

: : I I : I II I I I I I : 
Db 585 SDMEVSSSSVTTMASIPEITPKVIDTWRPKLTSSRKFWQDDPNTSDFTDPLSGI 639 

Qy 628 QDKKGVI RESYLKGHDQLVPVTLLAIAVT LAFVMGAVFSGITVYCVCD-HRRKDVAWQR 686 

Ml I : | : | : : | I I I I : I I : I : I I I I I I : : 

Db 640 — P KGVRWEVQ S G E SNQMVHMNVL I T C VFAAFVL GAF I AGVAVYC Y RDMFVRKN RK I — H 695 

Qy 687 KEKELTHSRRGSMSSVTKLSGLFG DTQSKDPKPEAI LTPLMHNGKLATPGNTAKML 742 

1:1 I 1111:111 : I I : I : I I : I I : 



Db 



696 KDAESAQSCTDSSGSFAKLNGLFDSPVKEYQQNIDSPKLYSNLLTSRKELPPNGDTKSMV 755 



Qy 743 IKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVIPTD 802 

: : I I I I I I I I I I I I I : I : : : I I : I I 

Db 756 MDHRGQPPELAALPTPESTPVLHQKTLQAMKSHSEKAHGH— GASRKETPQFFPSSPPPH 813 

Qy 803 LPLRASPSHIPSVWLPITQQGYQHEY VDQP KMSEVAQMALED 845 

1111111:111 I : : I I I : I 

Db 814 S PL- - SHGHI PSAI VLPNATHDYNTS FSNSNAHKAEKKLQNI DHPLTKSS SKRDHRRSVD 871 

Qy 846 QAATLE YKT I KEH LS S KS PN HGVNLVENLDSL PPKVPQREASLGPP 891 

|| : : I I : : I I : : : : I : I I I I I II II I I 

Db 872 SRNTL — NDLLKHLNDPNSNPKAIMGDIQMAHQNLMLDPMGSMSEVPPKVPNREASLYSP 929 

Qy 892 GASLSQTGLSKRLEMHHSSSY GVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLS 94 8 

: : I : : I I : : : : : : : I I II II : : : | I I I II 

Db 930 PSTLPRNS PTKRVDVPTTPGVPMTSLERQRGYHKNS SQR-HS I SAMPK-NLNS PNGVLLS 987 

Qy 949 RNQSFGRGD-NPP PAP Q RVD S I Q VH S S Q P S GQ AVT VS RQ P S LN AYN S LT RSGLK 1001 

I I || || : I I I I I I : I INI: : I I I : I I I 

Db 988 RQPSMNRGGYMPTPTGAKVDYIQ GTPVSVHLQPSLSRQSSYTSNGTLPRTGLK 1040 

Qy 1002 RTPSLKPDVPPKPSFAPLSTSMKP 1025 

I I I I I I I I I I I I I I I I : I I 
Db 1041 RTPSLKPDVPPKPSFVPQTPSVRP 1064 



RESULT 3 

US-09-077-940A-4 

; Sequence 4, Application US/09077940A 

; Patent No. 6576441 

; GENERAL INFORMATION: 

; APPLICANT: KIMURA, Toru et al . 

; TITLE OF INVENTION: NOVEL SEMAPHORIN Z AND GENE ENCODING THE SAME 

FILE REFERENCE: 0020-4426P 
; CURRENT APPLICATION NUMBER: US/09/077 , 94 OA 
; CURRENT FILING DATE: 1998-06-05 
; NUMBER OF SEQ ID NOS : 2 0 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 4 
; LENGTH: 888 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-077-940A-4 

Query Match 36.9%; Score 2013.5; DB 4; Length 888; 

Best Local Similarity 45.3%; Pred. No. 2.3e-182; 

Matches 424; Conservative 129; Mismatches 241; Indels 141; Gaps 20; 

Qy 5 ALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQR — HRLDIQMIM 62 

I I I I II I I I I I hi:: M MINI III I 1 = 11 :: 

Db 12 ALLLLLLLLGGAHGLFPEDPPPLSVAPRDYLNHYPVFVGSGPGRLTPAEGADDLNIQRVL 71 

Qy 63 IMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNFIK 122 

: | | | : | | | : : | | : : : : I : =1111 = 1 : I : : I I II I I : I I I I : I 

Db 72 RVNRTLFIGDRDNLYRVELEPPTSTELRYQRKLTWRSNPSDINVCRMKGKQEGECRNFVK 131 



Qy 123 VLLKKNDDALEVCGTNAFNPSCRNYKMDTLEPFGDEFSGMi^RCPYDAKHANVALFADGKL 182 

III : : : I I I I I : I I I I I I II : I I I : I II I I I I I II I I I I I I I I I I : I I I 
Db 132 VLLLRDESTLFVCGSNAFNPVCANYSIDTLQPVGDNISGMARCPYDPKHANVALFSDGML 191 

Qy 183 YSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVE 242 

: : I I I I II I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I : : I :: I I I I I I I I : I 
Db 192 FTATVTDFLAIDAVIYRSLGDRPTLRTVKHDSKWFKEPYFVHAVEWGSHVYFFFREIAME 251 

Qy 243 YNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVTDV 302 

: I : III 111:11111:111 I I II I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I 
Db 252 FN Y LE KVWS RVARVC KN DVGG S P RVL EKQWT S FLKARLN C S VP GD S H F Y FNVLQAVTGV 311 

Qy 303 IRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDERVP 362 

: : II I I I I I I I I 1111111111:1: : I : II I I I : I I I II : I I I I I I : : : I I 
Db 312 VSLGGRPWLAVFSTPSNS I PGSAVCAFDLTQVAAVFEGRFREQKS PES IWTPVPEDQVP 371 

Qy 363 KPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRLTK 422 

: I I I I I I I : I I : I I I I I I : I I I I I I I I I I I I :: I I I I I :: I :: I I : 

Db 372 RPRPGCCAAPGM — QYNASSALPDDILNFVKTHPLMDEAVPSLGHAPWILRTLMRHQLTR 429 

Qy 423 IAVDTAAGPYQNHTWFLGSEKGIILKFLAR — IGNSGFLNDSLFLEEMSVYNSEKCSYD 4 80 

: I I I I I I : I I I I I I I I I I : I I I I I II I : I I I I I -I 

Db 430 VAVDVGAGPWGNQT WFLGS EAGTVLKFLVRPNASTS GT S GLS VFLEEFET YRPDRCGRP 4 89 

Qy 481 GVED — KRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKE 538 

I : : | : : : : | | | | I II I I : : I I : I I : : : I I I I I : I I I I I I : 
Db 490 GGGETGQRLLSLELDAASGGLLAAFPRCWRVPVARCQQYSGCMKNCIGSQDPYCGWAPD 54 9 

Qy 539 GGACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYE 598 

I : I I I I : I MM: : I I M II 

Db 550 -GSCIFLSPGTRAAFEQDVSGASTSGLGDC 57 8 

Qy 599 SRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHDQLVPVTLLAIAVILA 658 

I : : I I : I I I II : : I 

Db 579 TGLLRASLSEDRAGLVSWLLVTSSVAA 606 

Qy 659 FVMGAVFS GI TV- YCVCDHRRKDVAVVQRKEKE — LTHS RRGSMS S VTKLS GLFGDTQS K 715 

I I : I II I I : I : I I : : : I : I I : II II : : M : : I I : : : : 

Db 607 FWGAWS G FS VGW FVG L RE RRE LA — RRKDKEAILAHGAGEAVLSVSRL GERRAQ 660 

Qy 716 DP KPEAI LTPLMHNGKLATPGNTAK-MLIKADQHHLDLTALPTPEST 761 

I 111:1 III II MM: III Mill I 

Db 661 GPGGRGGGGGGGAGVP PEAL LAP LMQNG WAKATLLQGGPHDLDSGLLPTPEQT 713 

Qy 762 PTLQQKRKP SRGSREWERNQNLINACTKDMPPMGSPVI PTDLP LRASPSH 811 

I I I I I I : I I I : I I :: I I : I : 

Db 714 P-LPQKRLPTPHPHPHALGPRAWDH GHPLLPASASSSLLLLAPAR 757 

Qy 812 I P SVWLP I TQQGYQHE YVDQPKMS EVAQMALEDQAATLE YKT I KEHLS S KS PNHGVNLV 871 

I I 1:1: IM : : : I : : 

Db 758 APEQPPAPGEPTPDGRLYAARPGRASHGDFPLTPHASPDRRRW SAPTGPLDPA 811 

Qy 872 ENLDSLP-PKVPQREASL GPPGASLSQT 898 

Mil I II I I I : I : I 

Db 812 SAADGLPRPWSPPPTGSLRRPLGPHAPPAATLRRT 846 



RESULT 4 

US-09-077-940A-2 

; Sequence 2, Application US/09077940A 

; Patent No. 6576441 

; GENERAL INFORMATION: 

; APPLICANT: KIMURA, Toru et al . 

; TITLE OF INVENTION: NOVEL SEMAP HO RIN Z AND GENE ENCODING THE SAME 
; FILE REFERENCE: 0020-4426P 

; CURRENT APPLICATION NUMBER: US/09/ 077 , 94 OA 
; CURRENT FILING DATE: 1998-06-05 
; NUMBER OF SEQ ID NOS : 20 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 2 
; LENGTH: 8 87 
; TYPE: PRT 

; ORGANISM: Rattus norvegicus 
US-09-077-940A-2 

Query Match 36.6%; Score 1997; DB 4; Length 887; 

Best Local Similarity 42.7%; Pred. No. 8.7e-181; 

Matches 433; Conservative 132; Mismatches 260; Indels 190; Gaps 25; 

Qy 2 RSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQR — HRLDIQ 5 9 

I II II II:: I : I :: : I I I I I I I I I I I I : I I 

Db 10 RPALLFLLLLLLRVTHGLFPDEPPPLSVAPRDYLSHYPVFVGSGPGRLTPAEGAEDLNIQ 69 

Qy 60 MIMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHN 119 

: : : I I I : I I I : : I I : : : I : I : : I I I I : I : I : I I I I I I I : I I I 
Db 70 RVLRVNRTLFIGDRDNLYQVELEPSTSTELRYQRKLTWRSNPSDIDVCRMKGKQEGECRN 129 

Qy 12 0 FI WLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFAD 17 9 

I : I I I I : : : I I I I I : I I I I I I II I I I I : II I I I I I I I I I I I I I I I I I : I 
Db 130 FVKVLLLRDESTLFVCGSNAFNPICANYSMDTLQLLGDNISGMARCPYDPKHANVALFSD 189 

Qy 180 GKLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREI 239 

I I : : I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I : : I : : I I I I I I I 
Db 190 GMLFTATVTDFLAIDAVIYRSLGDRPTLRTVKHDSKWFKEPYFVHAVEWGSHVYFFFREI 249 

Qy 24 0 AVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAV 2 99 

I : I : I : III I I I : I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I 

Db 250 AME FN YL E KVWS RVARVC KN DVGG S P RVL EKQWT S FL KARLN C S VP GD S H F Y FNVLQAV 309 

Qy 300 TDVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDE 359 

I I : : II I : I I I I I I I I I I I I I I I I : I I : I : I I I I I : I I I I I : I Mill:: 
Db 310 TGWSLGGRPVILAVFSTPSNSIPGSAVCAFDMNQVAAVFEGRFREQKSPESIWTPVPED 369 

Qy 360 RVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYR 419 

: I I : I I I I I II : I II I I : I I I : I I I I I I I I I I I I : : I I : I I : : I : : 

Db 370 QVPRPRPGCCAAPGM— QYNASNALPDEILNFVKTHPLMDEAVPSLGHSPWIVRTLIRHQ 427 

Qy 420 LTKIAVDTAAGPYQNHTWFLGSEKGIILKFLAR — IGNSGFLNDSLFLEEMSVYNSEKC 477 

I I : : I I I I I I : I I : I I I I I I I : I I I I : II I : I I I I I : : I 

Db 428 LTRVAVDVGAGPWGNQTIVFLGSEVGTVLKFLVKPNASVSGTTGPSIFLEEFETYRPDRC 4 87 

Qy 478 SYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCG 534 

I I : | : : : : | | I I I II I I : : I I : I I : : I I I I I : I I I I I 
Db 488 GRSSSAGEWGQRLLSLELDAASGGLLAAFPRCWRVPVARCQLYSGCMKNCIGSQDPYCG 547 



Qy 535 WIKEGGACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQ 594 

I : I : I II: Mill: : I I I I I I 

Db 548 WAPD-GSCIFLRPGTSATFEQDVSGASTSGLGDC 580 

Qy 595 EGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRESYLKGHDQLVPVTLLAIA 654 

I : : I I Mill : 

Db 581 TGLLRASLSDDRAGLVSVNLLVTS 604 

Qy 655 VI LAFVMGAVFS GI TV- YCVCDHRRKDVAWQRKEKE — LTHSRRGSMSSVTKL 705 

: I I I : I I I I I : I : I I : : : I : I I : I I II : : I I : : I 

Db 605 SVAAFWGAWSGFSVGWFVGLRERRELA — RRKDKEAI LAHGGS EAVL S VS RLGERRGT 662 

Qy 706 — SGLFGDTQSKDPKPEAI LTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPT 763 

II 111:111111 I I : III I I I I I II 

Db 663 GTGGRGGAGGGPGGPPEALLAPLMQNGW TKAALLHGGPHDLDSGLLPTPEQTP- 715 

Qy 764 LQQKRKPSR GSREWERNQNLINACTK DMPPMGSPVIPTD — 802 

Mill: | | | : : | : : | : I I : I I : 

Db 716 LPQKRLPTTHPHAHALGPRAWDHSHALLSASASTSLLLLAHTRAPEQPP VPTESG 770 

Qy 803 LPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVAQMALEDQAATLEYKTIK 856 

I II : I ||::: I : : 
Db 771 PESRLCAPRSCRASHPGDFPLTP HAS P DRRRWS APT G P LDS S SVG 816 

Qy 857 EHLSSKSPNHGVNLVENLDSLP-PKVPQREASL- GPPGASLSQTGLSKRLEMHHS 909 

I I I I I : I I I I I I : I : I I : 

Db 817 DDLPGPWSPPATSSLRRPGPHGPPTAALRRT HT 849 

Qy 910 SSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGD-NPPPAP 963 

: I : I II :: I : I I I I : III 

Db 850 FNSG EARPGGHRPRRHA PADSTHL LPCGTGERTAPPVP 887 



RESULT 5 

US-09-653-274-13 

Sequence 13, Application US/09653274 
Patent No. 6635742 
GENERAL INFORMATION: 
APPLICANT: Boyle, Bryan J 
APPLICANT: Yeung, George Y 
APPLICANT: Arterburn, Matthew C 
APPLICANT: Mize, Nancy K 
APPLICANT: Tang, Y. Tom 
APPLICANT: Liu, Chenghua 
APPLICANT: Drmanac, Radoje T 

TITLE OF INVENTION: Methods and Maaterials Relating to Semaphorin-Like 
TITLE OF INVENTION: Polypeptides and Polynucleotides 
FILE REFERENCE: HYS-23 

CURRENT APPLICATION NUMBER: US/ 09/ 653 , 274 
CURRENT FILING DATE: 2000-08-31 
PRIOR APPLICATION NUMBER: 09/491,404 
PRIOR FILING DATE: 2000-01-10 
NUMBER OF SEQ ID NOS : 13 
SOFTWARE: Patent In Ver. 2.1 
SEQ ID NO 13 
LENGTH: 641 



; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-09-653-274-13 



Query Match 35.3%; Score 1923; DB 4; Length 641; 

Best Local Similarity 55,3%; Pred. No. 5.5e-174; 

Matches 362; Conservative 101; Mismatches 118; Indels 74; Gaps 12; 

Qy 20 FPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMIMIMNGTLYIAARDHIYTV 7 9 

I I I I I I : : : | : : | | | | | | : | I : I | | I I I : : : : I I I I I I I : I I I 
Db 7 FPEDDEPLNTVDYHYSRQYPVFRG-RPSGNESQ-HRLDFQLMLKIRDTLYIAGRDQVYTV 64 

Qy 80 DIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNFIKVLLKKNDDALFVCGTNA 139 

: : : I : : I I I I I : I I I I : I I I I I I I I I I I I I I I I : : I I : : I I I I I I I 

Db 65 NLNEMPKTEVIPNKKLTWRSRQQDRENCAMKGKHKDECHNFIKVFVPRNDEMVFVCGTNA 124 

Qy 140 FNPSCRNYKMDTLEPFGDEFSGM7VRCPYDAKHANVALFADGKLYSATVTDFLAIDAVIYR 199 

I I I || I :: III I : I I I : I I I I : I I : I I I II I I I I I I I I I I I I I I I I I I I I 
Db 125 FNPMCRYYRLSTLEYDGEEISGLARCPFDARQTNVALFADGKLYSATVADFLASDAVIYR 184 

Qy 2 00 SLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVEYNTMGKWFPRVAQVCK 259 

I : I : I I I : I : I I I I : I I I : I : I : : I I : I : I I I I I I I I I I : I : I I I : I I I : : I I 
Db 185 SMGDGSALRTIKYDSKWIKEPHFLHAIEYGNYVYFFFREIAVEHNNLGKAVYSRVARICK 244 

Qy 260 NDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVTDVIRINGRDWLATFSTPY 319 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I : : I I : : I I : I : I I I I : I : I 
Db 245 NDMGGSQRVLEKHWTSFLKARLNCSVPGDPFFYFDVLQSITDIIQINGIPTWGVFTTQL 304 

Qy 320 NSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDERVPKPRPGCCAGSSSLERYA 379 

I I I I I I I I I I : I I I I I I I I I I I I : I I I I I I I ::: I I I I I I I I I I II 

Db 305 NSIPGSAVCAFSMDDIEKVFKGRFKEQKTPDSVWTAVPEDKVPKPRPGCCAKHGLAEAYK 364 

Qy 380 T S N E F P D DT LN F I KT H P LMD EAVP S I FN RP W FL RTMVR YRLT K I AVDT AAG P YQN HTWF 439 

II : I I I : II : I I I : I I I II III I : III : I I I I I I I I : I I : I I I I I I : I I : I 

Db 365 TSI DFPDETLS FI KSHPLMDSAVPPIADEPWFTKTRVRYRLTAI SVDHSAGPYQNYTVI F 424 

Qy 440 LGSEKGIILKFLARIGNSGF-LNDSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRASSS 498 

: I I I I : : I I I I : I I I I I I : I II : II III: I I I : : : : I I I : : 
Db 425 VGSEAGMVLKVLAK — TSPFSLNDSVLLEEIEAYNHAKCSAENEEDKKVISLQLDKDHHA 482 

Qy 4 99 LYVAFSTCVIKVPLGRCERHGKCKKTCIASRDPYCGWIKEGGACSHLSPNSRLT 552 

I I I I I I : I : I : : I I I II I : I I II : I I I I I I I I I I I : : 1 = 1 :: I I 

Db 4 83 LYVAFSSCIIRIPLSRCERYGSCKKSCIASRDPYCGWLSQ-GSCGRVTPGMLLLTEDFFA 541 

Qy 553 FEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESRGG — 602 

: I M I I I I INN : II : : I I I I : II 

Db 542 FHNHSAEGYEQDTEFGNTAHLGDCH-- EILPTSTTPD YKIFGGPT 584 

Qy 603 MLD-WKHLL DSP DSTDPLGAV 622 

: : I I : I II Mill: 

Db 585 SDMEVSSSSVTTMASIPEITPKVT DTWRPKLTSSRKFWQDDPNTSDFTDPLSGI 639 



RESULT 6 
US-09-254-594-6 

; Sequence 6, Application US/09254594 
; Patent No. 6566094 



GENERAL INFORMATION: 
APPLICANT: KIMURA, Toru 
APPLICANT: KIKUCHI, Kaoru 

TITLE OF INVENTION: NOVEL SEMAPHORIN GENE: SEMAPHORIN Y 
FILE REFERENCE: 0020-4527P 
CURRENT APPLICATION NUMBER: US/ 09/254 , 594 
CURRENT FILING DATE: 1999-05-11 
NUMBER OF SEQ ID NOS : 13 
SOFTWARE: Patentln version 3.0 
SEQ ID NO 6 
LENGTH: 930 
TYPE: PRT 

ORGANISM: Homo sapiens 
FEATURE: 

NAME/ KEY: misc_f eature 
LOCATION: ()..() 

OTHER INFORMATION: Tissue Type: Child Brain 
NAME/KEY: misc_f eature 
LOCATION: ()..() 

OTHER INFORMATION: Identification Method: P for resulting peptide 
US-09-254-594-6 

Query Match 27.9%; Score 1519.5; DB 4; Length 930; 

Best Local Similarity 35.9%; Pred. No. 3.2e-135; 

Matches 381; Conservative 135; Mismatches 356; Indels 189; Gaps 32; 

Qy 6 LLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMIMIMN 65 

I I I : I I I I I : I I : I I I I I = ' | | | : : | 

Db 13 LLLLLSLPH-TQAAFPQDPLPLLISDLQGTSPLSWFRGLEDDAVAAEL-GLDFQRFLTLN 70 

Qy 66 GTLYIAARDHIYTVDIDTSHT-EEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNFIKVL 124 

I I : I I I I I : : : I : I : : I I I I : I : I I : I : : I I I I I : I : I : I I 

Db 71 RTLLVAARDHVFSFDLQAEEEGEGLVPNKYLTWRSQ— DVENCAVRGKLTDECYNYIRVL 128 

Qy 125 LKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADGKLYS 184 

: : | | | I I : I : I I I : I : : I : I : I I I I I I I : I I : I I I : I I : I I I I 
Db 129 VPWDSQTLLACGTNS FS PVCRS YGI T S LQQEGEELSGQARCP FDATQSNVAI FAEGS LYS 188 

Qy 185 ATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVEYN 244 

|| M I I I I : I I I I I I II: I : I I I I I : I I : I I I I :: : M : : I I I I I I :: I I 
Db 18 9 ATAADFQASDAWYRSLGPQPPLRSAKYDSKWLREPHFVQALEHGDHVYFFFREVSVEDA 24 8 

Qy 245 TMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVTDVIR 304 

:||| I 111:111 Mill I I:: I I I I I I IIIIIIMII ll!::lll:| : 

Db 249 RLGKVQFSRVARVCKRDMGGSPRALDRHWTSFLKLRLNCSVPGDSTFYFDVLQALTGPVN 308 

Qy 305 INGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDERVPKP 364 

: : I I : I : I I I I I I I I I II : : : I I I : I I I I : I I I I M : : II I I . 
Db 309 LHGRSALFGVFTTQTNSIPGSAVCAFYLDEIERGFEGKFKEQRSLDGAWTPVSEDRVPSP 368 

Qy 365 RPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRLTKIA 424 

I I I I I I : : : | : | M I I I I ! I 1 : I I II : : : I I I I I : : I 

Db 369 RPGSCAGVGGAALFSSSRDLPDDVLTFIKAHPLLDPAVPPVTHQP-LLTLTSRALLTQVA 427 

Qy 425 VDTAAGPYQNHTVVFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVED 484 

II I I I : I I I : I I I I I : I I I I I I : : I I I : I : : I I 

Db 42 8 VDGMAGPHSNITVMFLGSNDGTVLKVLTPGGRSGG- PEPI LLEEI DAYS PARCSGKRTAQ 4 86 



Qy 485 — KRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGGAC 542 

: I I : I : : I I I : I I I I I : : : I I II III I : : : I : I I : I I I I I I I I 

Db 487 TARRIIGLELDTEGHRLFVAFSGCIVYLPLSRCARHGACQRSCLASQDPYCGWHSSRG-C 545 

Qy 543 SHLSPNSRLTFEQDIERGNTDGL — GDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESR 600 

: : : I I I : : III: : I : I 
Db 546 VDIRGSGGTDVDQ AGNQESMEHGDCQDG ATGSQSGPGDSAY 586 

Qy 601 GGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHDQLVPVTLLAIAVI LAFV 660 

III: : I I : I I : I II 

Db 587 GVRRDLPPASASRSVPI PLLLASVAAAFA 615 

Qy 661 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRG SMSSVTKLSGL 7 08 

: I I I I : I I I I I I I I I | : | : : | | 

Db 616 LGASVSGLLVSCAC — RR AHRRRGKDIETPGLPRPLSLRSLARLHG- 659 

Qy 709 FGDTQSKDPKP— EAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQ 766 

I : I : I : I I :: II : I I I I I I I I I I 

Db 660 -GGPEPPPPSKDGDAVQTPQLYTTFLPPPEGVPPP ELACLPTPESTPELPV 709 

Qy 767 KRKPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSH IPSWVLPIT 821 

I : I I I I I I I I : I I : I I hi I 

Db 710 KHLRAAGD- PWEWNQNRNNA KEGPGRSRGGHAAGGPAPRVLVRP — 752 

Qy 822 QQGYQHEYVDQPKMSEVAQMALEDQAATLEYKTIKEHL SSKSPNHGVNLVENLDS- 876 

I I : | | : : I | : I I II 

Db 753 PPPGCPGQ— AVEVTTLEELLRYLHGPQPPRKGAEPPAPLTSR 793 

Qy 877 -LPPKVPQREASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLK 935 

III: I I I I : II:: I II 
Db 794 ALPPE — PAPALLGGPSPRPHECASPLRLDV PPEGRCASAPA 833 

Qy 936 RNNTNSSNSSHL S RNQS FGRGDN P P PAPQRVDS I QVH S S Q P S GQAVT VS RQ P S 988 

I : : I II II II III II 

Db 834 -RPALSAPAPRLGVGGGRRLPFSGHRAPPALLTRV PSGGPSRYSGGPGKHL 883 

Qy 989 — LNAYNSLTRSGLKRTPSLKPDVPPKPSF-APLSTSMKPN 1026 

I III I I : I I II II 

Db 884 LYLGRPEGYRGRALKRVDVEKPQLSLKPPLVGPSSRQAVPN 924 



RESULT 7 
US-09-254-594-3 

; Sequence 3, Application US/09254594 

; Patent No. 6566094 

; GENERAL INFORMATION : 

; APPLICANT: KIMURA, Toru 

; APPLICANT: KIKUCHI, Kaoru 

; TITLE OF INVENTION: NOVEL SEMAPHORIN GENE: SEMAPHORIN Y 

; FILE REFERENCE: 0020-4527P 

; CURRENT APPLICATION NUMBER: US/09/254 , 594 

; CURRENT FILING DATE: 1999-05-11 

; NUMBER OF SEQ ID NOS : 13 

SOFTWARE: Patentln version 3.0 
; SEQ ID NO 3 
; LENGTH: 929 



; TYPE: PRT 

; ORGANISM: Rattus norvegicus 
FEATURE : 

NAME/ KEY : misc_f eature 

LOCATION: ()..() 
; OTHER INFORMATION: Tissue Type: Brain 

NAME/KEY: misc_f eature 
; LOCATION: ()..() 

; OTHER INFORMATION: Identification Method: P for resulting peptide 
US-09-254-594-3 

Query Match " 26.2%; Score 1428.5; DB 4; Length 929; 

Best Local Similarity 34.0%; Pred. No. 1.5e-126; 

Matches 360; Conservative 147; Mismatches 355; Indels 197; Gaps 32; 

Qy 6 LLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMIMIMN 65 

III II I I : I I : I I II: : Ml:: 

Db 11 LLLLLLSLPQAQTAFPQDPIPLLTSDLQGTSPSSWFRGLEDDAVAAEL-GLDFQRFLTLN 69 

Qy 66 GTLYIAARDHIYTVDIDTSHT-EEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNFIKVL 124 



Db 70 RTLLVAARDHVFSFDLQAQEEGEGLVPNKFLTWRSQ — DMENCAVRGKLTDECYNYI RVL 127 

Qy 125 LKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADGKLYS 184 

: : I I I I I : I : I I I : I : : I : I : I I I I I I I : I I : I I : I : I I I I 
Db 128 VPWDSQTLLACGTNSFSPVCRSYGITSLQQEGEELSGQARCPFDATQSTVAISAEGSLYS 187 

Qy 185 ATVTDFLAI DAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVEYN 244 

II II I 11:11111 I II: I : I I I I I : I I : I I I : : : I I : : I I I I : : : 
Db 188 ATAADFQASDAWYRSLGPQPPLRSAKYDSKWLREPHFVYALEHGDHVYFFLPEKSL-WR 246 

Qy 24 5 T — MGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVTDV 302 

I : I : I I 111:111 I I I I I I I : : I I I I I I I I I I I I I I I I I I I : : I I : : I 
Db 247 TPGLGRVQFSRVARVCKRDMGGSPRALDRHWTSFLKLRLNCSVPGDSTFYFDVLQSLTGP 306 

Qy 303 IRINGRDWXATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDERVP 362 

: : : I I : I : I I I I I I I I I I I : : I I I I : I I I I : I I I I I I : : : II 
Db 307 VNLHGRSALFGVFTTQTNSIPGSAVCAFYLDDIERGFEGKFKEQRSLDGAWTPVSEDKVP 366 

Qy 363 KPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRLTK 422 

I I I I I I I : : : : I : I I I I I I I I I I : I I I I : : I I I I I : 

Db 367 SPRPGSCAGVGAAALFSSSQDLPDDVLLFIKAHPLLDPAVPPATHQP-LLTLTSRALLTQ 425 

Qy 42 3 IAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYDGV 482 

: I I I I I I :: I I I : I I I I I : I I I I : : I I I : I : : I I I 

Db 426 VAVDGMAGPHRNTTVLFLGSNDGTVLKVLPP-GGQSLGPEPIILEEIDAYSHARCS — GK 482 

Qy 483 ED KRIMGMQLDRAS S SLYVAFSTCVI KVPLGRCERHGKCKKTCI AS RDPYCGWI KE 538 

: I I : I : : I I I : I I I I : : : I I I I I I I : : : I : I I I I I I II : 

Db 483 RSPRAARRIIGLELDTEGHRLFVAFPGCIVYLSLSRCARHGACQRSCLASLDPYCGWHRF 542 

Qy 539 GGACSHLSPNSRLTFEQDIE-RGNTDGL — GDCHNSFVALNGHSSSLLPSTTTSDSTAQE 595 

I : I | :: | | : : | | | : : | : 

Db 543 RGCVNIRGPGG TDVDLTGNQESMEHGDCQDG ATGSQSGPGDS 584 

Qy 596 GYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHDQLVPVTLLAIAV 655 

I III: : : | : | | | 



Db 



585 AY- 



GVRRDLSPASASRSIPIPLLLACV 610 



Qy 656 ILAFVMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSK 715 

I I : I I I I : I I I II: : I : I : : I : I : I I 

Db 611 AAAFALGASVSGLLVSCAC--RRAN RRRSKDI ETPGLPRPLSLRSLARLHGGGPEP 664 

Qy 716 DPKP EAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSR 772 

II : I I I : : | | : : I I I I I I : I I I I : 

Db 665 PPPPKDGDAAQTPQLYTTFLPPPEGGSPP ELACLPTPETTPELPVKHLRAS 715 

Qy 773 GSREWERNQNLINACTKDMPPMGSPVIPTDLP LRASPSHIPSVWLPITQQ GYQ 826 

I I I I I I I I II I : I I I I I : I 

Db 716 GG-PWEWNQNGNNASEGPGRPRGCSAAGGPAPRVLVRPPPPGCPGQEVEVTTLEELLRYL 774 

Qy 827 H E YVDQPKMSEVAQMALEDQAATLEYKT I KEHLS S KS PNHGVNLV 871 

I : : I I I I Ml 

Db 775 HGPQPPRKGSEPLASAPFTSRPPASEPGAALFVD SSPMPR 814 

Qy 872 ENLDSLPP KVPQREASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSY 920 

1:11 I : I I I : I I : I I : 
Db 815 DCVP PLRLDVP PDGKRAAP S GRPALSAPAPRLGVSG- S RRL PF 856 

Qy 921 PTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGDNPPPAPQRVDS — IQVHSSQPSG 978 

II I : I I I I I I : I I 

Db 857 PT HRA PPGLLTRVPSGGPSRYSGGPGR 883 

Qy 979 QAVTVSRQPSLNAYNSLTRSGLKRTPSLKPDV — PPKPS 1015 

: : I I : III : I III: II : I : 
Db 884 HLLYLGR-PDGHRGRSLKRVDVKSPLSPKPPLATPPQPA 921 



RESULT 8 

US-09-653-274-10 

Sequence 10, Application US/09653274 
Patent No. 6635742 
GENERAL INFORMATION: 
APPLICANT: Boyle, Bryan J 
APPLICANT: Yeung, George Y 
APPLICANT: Arterburn, Matthew C 
APPLICANT: Mize, Nancy K 
APPLICANT: Tang, Y. Tom 
APPLICANT: Liu, Chenghua 
APPLICANT: Drmanac, Radoje T 

TITLE OF INVENTION: Methods and Maaterials Relating to Semaphorin-Like 
TITLE OF INVENTION: Polypeptides and Polynucleotides 
FILE REFERENCE: HYS-23 

CURRENT APPLICATION NUMBER: US/ 09/ 653 , 274 
CURRENT FILING DATE: 2000-08-31 
PRIOR APPLICATION NUMBER: 09/491,404 
PRIOR FILING DATE: 2000-01-10 
NUMBER OF SEQ ID NOS : 13 
SOFTWARE : Patentln Ver. 2.1 
SEQ ID NO 10 
LENGTH: 536 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-653-274-10 



Query Match 25.3%; Score 1377.5; DB 4; Length 536; 

Best Local Similarity 49.3%; Pred. No. 4.1e-122; 

Matches 265; Conservative 97; Mismatches 167; Indels 9; Gaps 



7; 



Qy 6 LLLYFTLLHFAGAGFPEDSEPI SI SHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMIMIMN 65 

I I I : I I I I I : I I : I I I I I : : I I I : : I 

Db 4 LLLLLSLPH-TQAAFPQDPLPLLISDLQGTSPLSWFRGLEDDAVAAEL-GLDFQRFLTLN 61 

Qy 66 GTLYI AARDHI YTVDI DTSHT- EEI YCSKKLTWKSRQADVDTCRMKGKHKDECHNFI KVL 124 

I I : I I I I I : : : I : I : : I I I I : I : I I : I : : I I I I I : I : I : I I 

Db 62 RTLLVAARDHVFSFDLQAEEEGEGLVPNKYLTWRSQ — DVEN C AVRGK LT D E C YN YI RVL 119 

Qy 125 LKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADGKLYS 184 

: : I I I I I : I : I I I : I : : I : I : I II I I I I : I I : I I I : I I : I I I I 
Db 120 VPWDSQTLLACGTNSFSPVCRSYGITSLQQEGEELSGQARCPFDATQSNVAIFAEGSLYS 179 

Qy 185 ATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVEYN 244 

II II I 111:11111 I II: | : | | M I : I I : I I I I : : : I I : : I I I I I I : : I I 

Db 180 ATAADFQASDAWYRSLGPQPPLRSAKYDSKWLREPHFVQALEHGDHVYFFFREVSVEDA 239 

Qy 245 TMGKWF P RVAQ VC KN DMGGS Q RVL E KQWT S FL KARLN C S VP GD S H F Y FN I LQAVT D VI R 304 

: I I I I I I I : I I I I I I I I I I : : II I I I I I I I I I I I I I I I I I : : I I I : I : 
Db 240 RLGKVQFSRVARVCKRDMGGSPRALDRHWTSFLKLRLNCSVPGDSTFYFDVLQALTGPVN 299 

Qy 305 INGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDERVPKP 364 

: : I I : I : I I I I I I I M I I : : : I I I : I I M : I I I I I I : : I I I I 
Db 300 LHGRSALFGVFTTQTNSIPGSAVCAFYLDEIERGFEGKFKEQRSLDGAWTPVSEDRVPSP 359 

Qy 365 RPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRLTKIA 424 

I I I I I I : : : I : I I I I I I I I I I : I I I I : : : I I I I I : : I 

Db 360 RPGSCAGVGGAALFSSSRDLPDDVLTFIKAHPLLDPAVPPVTHQP-LLTLTSRALLTQVA 418 

Qy 425 VDTAAGPYQNHTVVFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVED 4 84 

II I I I : I I I : I I I I I : I I I III : : III: I : : I I 

Db 419 VDGMAGPHSNITVMFLGSNDGTVLKVLTPGGRSGG-PEPILLEEIDAYSPARCSGKRTAQ 477 

Qy 485 — KRIMGMQLDRASSSLYVAFSTCVIKVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 540 

: I I : I : : I I I : I I I I I : : : I I II III I : : : I : I I : I I II I I I 

Db 478 TARRIIGLELDTEGHRLFVAFSGCIVYLPLSRCARHGACQRSCLASQDPYCGWHSSRG 535 



RESULT 9 

US-08-121-713D-58 

Sequence 58, Application US/08121713D 
Patent No. 5639856 
GENERAL INFORMATION: 

APPLICANT: Goodman, Corey S. 
APPLICANT: Kolodkin, Alex L. 
APPLICANT: Matthes, David 
APPLICANT: Bentley, David R. 
APPLICANT: O'Connor, Timothy 

TITLE OF INVENTION: The Semaphorin Gene Family 
NUMBER OF SEQUENCES: 100 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 Bush Street, Suite 3200 



; CITY: San Francisco 

STATE: CA 

COUNTRY: USA 

ZIP: 94104 
COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
; OPERATING SYSTEM: PC-DOS/MS-DOS 

; SOFTWARE: Patentln Release #1.0, Version #1.25 

CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/08/121, 7 13D 

; FILING DATE: 13-SEP-1993 

CLASSIFICATION: 514 
; ATTORNEY/ AGENT INFORMATION: 

; NAME: Osman, Richard A. 

REGISTRATION NUMBER: 36,627 

REFERENCE/DOCKET NUMBER: B94-002-1 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (415)343-4341 

TELEFAX: (415) 343-4342 

TELEX: 

; INFORMATION FOR SEQ ID NO: 58: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 730 amino acids 

; TYPE: amino acid 

TOPOLOGY: linear 

MOLECULE TYPE: protein 
US-08-121-713D-58 

Query Match 17.8%; Score 969.5; DB 1; Length 730; 

Best Local Similarity 34.4%; Pred. No. 5.7e-83; 

Matches 244; Conservative 124; Mismatches 224; Indels 117; Gaps 29; 
Qy 8 LYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQR HRLDIQMI 61 



Db 



11 LLWVALH — AAAWVNDVSP 



I I I 

KMYVQF- 



GEERVQRFLGNESHKDHFKLL 54 



QY 



62 MIMNGTLYIAARDHIYTVDID— TSHTEEI YCSKKLTWKSRQADVDTCRMKGKHKDECHN 119 



Db 



55 EKDHNSLLVGARNIVYNISLRDLTEFTEQ RIEWHSSGAHRELCYLKGKSEDDCQN 109 



Qy 



120 FI KVLLKKNDDALFVCGTNAFNP S CRN YKMDT LEP FGD EFSGMARCPYDAKHANV 174 



Db 



110 YIRVLAKIDDDRVLICGTNAYKPLCRHYALKD GDYWEKEYEGRGLCPFDPDHNST 165 



Qy 



175 ALFADGKLYSATVTDFLAI DAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYF 234 




Db 



166 AIYSEGQLYSATVADFSGTDPLIYRG PLRTERSDLKQLNAPNFVNTMEYNDFIFF 220 



Qy 



Db 



235 FFREIAVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFN 294 

I I I I I I I I I I : : I II : I I I : I II : :: I I I I I I : I I I I I I II I I I I I 

221 FFRETAVEYINCGKAIYSRVARVCKHDKGGPHQGGDR-WTSFLKSRLNCSVPGDYPFYFN 279 



Qy 

Db 



295 I LQAVTDVI RI N-GRDV VLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPD 350 

: I : : I : I II I : I : I I I I I I I I I I I : I I I I I I I I : : : 
280 EIQSTSDI I EGNYGGQVEKLI YGVFTTPVNS I GGSAVCAFSMKS I LES FDGPFKEQETMN 339 



Qy 351 STWTPVPDERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPW 410 

II II : I I : I I I I I I I I :: I I : I : I I I I I I I I : I I I 

Db 340 SNWLAVPSLKVPEPRPGQCVND S RT L P D VS VN FVK S HT LMD EAVP AF FT RP I 391 

Qy 411 FLRTMVRYRLTKIAVD TAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLN 461 

:| ::|l llllll I I : : I : I : I : : I : : I I II : 

Db 392 LIRISLQYRFTKIAVDQQVRTPDG — KAYDVLFIGTDDGKVIKAL NSASFDSSDTV 445 

Qy 462 DSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRC — ERHG 519 

I I : : M : I II I : : : : I III : : : I I I : : 

Db 446 DSWI EELQVLP PGVPVKNL YWRMDGDDS KLVWS DDEI LAI KLHRCGS DKI T 499 

Qy 52 0 KCKKTCIASRDPYCGWIKEGGACSHL-SPN SRLTFEQDIERGNTDGLGDCHNSFVAL 575 

I : : I : : : I I I I I I : : I I : : I I : I I I : : 

Db 500 NCRE-CVSLQDPYCAWDNVELKCTAVGSPDWSAGKRRFIQNISLGEHKACGGRPQTEIV- 557 

Qy 57 6 NGHSSSLLPSTTTSDSTA QEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQ 62 8 

: I : I : I : I : I : I : I : : : II : : I I 

Db 558 AS PVPTQPTTKSSGDPVHSIHQAEFEPE 1 DNEI VIGVDDSNVI PNTLAEINHA 610 

Qy 629 DKKGVI RES YLKGHDQLVPV TL-LAI AVI LAFVMGAVFS 666 

I I : : I : I I : I I I : : : I : I : I I 

Db 611 GSK LPSSQEKLPIYTAETLTIAIVTSCLGALWGFISGFLFS 652 



RESULT 10 
US-08-835-268-58 

Sequence 58, Application US/08835268 
Patent No. 5807826 
GENERAL INFORMATION: 

APPLICANT: Goodman, Corey S. 
APPLICANT: Kolodkin, Alex L . 
APPLICANT: Matthes, David 
APPLICANT: Bentley, David R. 
APPLICANT: O'Connor, Timothy 

TITLE OF INVENTION: The Semaphorin Gene Family 
NUMBER OF SEQUENCES: 100 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 Bush Street, Suite 3200 
CITY: San Francisco 
STATE : CA 
COUNTRY: USA 
ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1-25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/835,268 
FILING DATE: 
CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/08/121,713 
FILING DATE: 13-SEP-1993 
ATTORNEY/ AGENT INFORMATION: 



NAME: Osman, Richard A. 

REGISTRATION NUMBER: 36,627 
; REFERENCE/ DOCKET NUMBER: B94-002-1 

; TELECOMMUNICATION INFORMATION: 

TELEPHONE: (415)343-4341 

TELEFAX: (415) 343-4342 

TELEX: 

; INFORMATION FOR SEQ ID NO: 58: 

SEQUENCE CHARACTERISTICS: 
LENGTH: 730 amino acids 
; TYPE: amino acid 

; TOPOLOGY: linear 

MOLECULE TYPE: protein 
US-08-835-268-58 



Query Match 17.8%; Score 969.5; DB 1; Length 730; 

Best Local Similarity 34.4%; Pred. No. 5.7e-83; 

Matches 244; Conservative 124; Mismatches 224; Indels 117; Gaps 



Qy 


8 


LYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQR HRLDIQMI 


61 


Db 


11 


1 : 1 1 1 : 1 1 1 1 1 1 II 1 : ::: 
LLWVALH— AAAWVNDVSP KMYVQF GEERVQRFLGNESHKDHFKLL 


54 


Qy 


62 


MIMNGTLYIAARDH I YTVDI D — TSHTEEI YCSKKLTWKSRQADVDTCRMKGKHKDECHN 
: : | : I 1 : : 1 : : 1 I I : : : 1 1 1 : 1 : 1 H : 1 : 1 1 
EKDHNSLLVGARNIVYNISLRDLTEFTEQ RIEWHSSGAHRELCYLKGKSEDDCQN 


119 


Db 


55 


109 


Qy 


120 


FIKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGD EFSGMARCPYDAKHANV 

: I : 1 1 1 : 1 1 : : 1 1 1 1 1 : 1 1 1 : 1 : II hi 11 = 1 1 : 
YIRVLAKIDDDRVLICGTNAYKPLCRHYALKD GDYWEKEYEGRGLCPFDPDHNST 


174 


Db 


110 


165 


Ov 


175 


ALFADGKLYSATVTDFLAI DAVI YRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYF 
|::::|:| I I I 1 1 II | :||| III : 1 1 1 III ::| hhl 
AIYSEGQLYSATVADFSGTDPLIYRG PLRTERSDLKQLNAPNFVNTMEYNDFIFF 


234 


Db 


166 


220 


Qy 


235 


FFREIAVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFN 

MM MM II : : 1 1 1 : II h 1 M : : : 1 M II h II 1 1 1 1 1 II MM 
FFRETAVEYINCGKAIYSRVARVCKHDKGGPHQGGDR-WTSFLKSRLNCSVPGDYPFYFN 


294 


Db 


221 


279 


Qy 


295 


ILQAVTDVIRIN-GRDV VLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPD 

: I : : 1 : 1 I I 1 : h 1 1 1 II II 1 II h 1 1 1 1 1 1 1 h : : 
EIQSTSDIIEGNYGGQVEKLIYGVFTTPVNSIGGSAVCAFSMKSILESFDGPFKEQETMN 


350 


Db 


280 


339 


Qy 


351 


STWTPVPDERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPW 

|| M : 1 1 : 1 1 M 1 1 II : : 1 h h 1 II M II h 1 II 
SNWLAVPSLKVPEPRPGQCVND SRTLPDVSVNFVKSHTLMDEAVPAFFTRPI 


410 


Db 


340 


391 


Qy 


411 


FLRTMVRYRLTKIAVD TAAGP YQNHT WFLGSEKGI I LKFLARI GNS GFLN 

:| ::|| MIMI 1 1 : : I : I : 1 : : 1 : : 1 1 II : 
LIRISLQYRFTKIAVDQQVRTPDG— KAYDVLFIGTDDGKVIKAL NSASFDSSDTV 


461 


Db 


392 


445 


Qy 


462 


DSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVIKVPLGRC— ERHG 
|| : : 1 1 : 1 II 1 : : : M III : : : 1 I I : : 
DSWI EELQVLP PGVPVKNLYWRMDGDDSKLVWSDDEILAIKLHRCGSDKIT 


519 


Db 


446 


499 


Qy 


520 


KCKKTCIAS RDPYCGWI KEGGACSHL- S PN SRLTFEQDI ERGNTDGLGDCHNS FVAL 


575 



I : : I : : : I I I I I 



Db 500 NCRE- CVS LQDP YCAWDNVELKCTAVGS PDWSAGKRRFI QN I S LGEHKACGGRPQTEI V- 557 



Qy 57 6 NGHSSSLLPSTTTSDSTA QEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQ 62 8 

: I : I : I : I : I : I : | : : : | I : : I I 

Db 558 AS PVPTQPTTKSSGDPVHSIHQAEFEPE 1 DNEI VI GVDDSNVI PNTLAEINHA 610 

Qy 629 DKKGVI RESYLKGHDQLVPV TL-LAI AVI LAFVMGAVFS 666 

I I : : I : I I : I I | : : : | : | : | | 

Db 611 GSK LPSSQEKLPIYTAETLTIAIVTSCLGALWGFISGFLFS 652 



RESULT 11 
US-09-060-692-58 

Sequence 58, Application US/09060692 
Patent No. 5935865 
GENERAL INFORMATION: 

APPLICANT: Goodman, Corey S. 
APPLICANT: Kolodkin, Alex L. 
APPLICANT: Matthes, David 
APPLICANT: Bentley, David R. 
APPLICANT: O'Connor, Timothy 

TITLE OF INVENTION: The Semaphorin Gene Family 
NUMBER OF SEQUENCES: 100 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 Bush Street, Suite 3200 
CITY: San Francisco 
STATE: CA 
COUNTRY: USA 
ZIP : 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/060, 692 
FILING DATE: 
CLASSIFICATION: 514 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/08/121, 713 
FILING DATE: 13-SEP-1993 
ATTORNEY/AGENT INFORMATION: 
NAME: Osman, Richard A. 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: B94-002-1 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
TELEX: 

INFORMATION FOR SEQ ID NO: 58: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 730 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-09-060-692-58 



Query Match 17.8%; Score 969.5; DB 2; Length 730; 

Best Local Similarity 34.4%; Pred. No. 5.7e-83; 

Matches 244; Conservative 124; Mismatches 224; Indels 117; Gaps 29; 

Qy 8 LYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQR HRLDIQMI 61 

I : I I I : I I Ml I II I : ■ ■ ■ 

Db 11 LLWVALH— AAAWVNDVSP KMYVQF GEERVQRFLGNESHKDHFKLL 54 

Qy 62 MIMNGTLYIAARDHIYTVDID — TSHTEEI YCSKKLTWKSRQADVDTCRMKGKHKDECHN 119 

: : I : I I : : I : : I I I : : : I I I : I : I I I : I : I I 

Db 55 EKDHNSLLVGARNIVYNISLRDLTEFTEQ RIEWHSSGAHRELCYLKGKSEDDCQN 109 

Qy 120 FIKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGD E FS GMARC P Y DAKHANV 174 

: I : I I I : I I : : I II I I : I I I : I : II I : I I I : I I = 

Db 110 YI RVLAKI DDDRVLI CGTNAYKPLCRHYALKD GDYWEKEYEGRGLCPFDPDHNST 165 

Qy 175 ALFADGKLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYF 234 

I : : : : I : I I I I I I I I I : I I I I I I : I I I III : : I I : I : I 

Db 166 AIYSEGQLYSATVADFSGTDPLIYRG PLRTERSDLKQLNAPNFVNTMEYNDFIFF 220 

Qy 235 FFREIAVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFN 2 94 

I I I I I I I I I I : : I I I : I I I : I I I : : : I I I I I I : I I I I I I II I I I I I 
Db 221 FFRETAVE YI NCGKAI YS RVARVCKHDKGGPHQGGDR-WT S FLKS RLNCSVPGDYP FYFN 279 

Qy 295 I LQAVT DVI RI N- GRDV VLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPD 350 

: I : : I : I II I : I : I I I I I I I I I I I : I I I I I I I I : : : 
Db 280 EI QST S DI I EGN YGGQVEKLI YGVFTT PVNS I GGS AVCAFSMKS I LES FDGP FKEQETMN 339 

Qy 351 STWTPVPDERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPW 410 

|| || : I I : I I I I I I I I :: I I : I : I I I I I I I I : I I I 

Db 340 SNWLAVPSLKVPEPRPGQCVND SRTLPDVSVNFVKSHTLMDEAVPAFFTRPI 391 

Qy 411 FLRTMVRYRLTKIAVD TAAGP YQNHT WFLGS EKGI I LKFLARI GNS GFLN 461 

: I :: I I I I I I I I I I : : | : | : | : : I : : I I II : 

Db 392 LI RI SLQYRFTKIAVDQQVRTPDG — KAYDVLFIGTDDGKVIKAL NSASFDSSDTV 445 

Qy 462 DSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRC — ERHG 519 

I I : : I I : I II I : : : : I III : : : I I I : : 

Db 446 DSWI EELQVLP PGVPVKNLYWRMDGDDS KLVWS DDEI LAI KLHRCGS DKI T 4 99 

Qy 520 KCKKTCIASRDPYCGWIKEGGACSHL-SPN SRLTFEQDIERGNTDGLGDCHNSFVAL 575 

I : : I : : : I I I I I I : : I I : : I hi I I : : 

Db 500 NCRE-CVSLQDPYCAWDNVELKCTAVGSPDWSAGKRRFIQNISLGEHKACGGRPQTEIV- 557 

Qy 576 NGHSSSLLPSTTTSDSTA QEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQ 628 

:| :|: I: I: I :| :| : :: II :: II 

Db 558 ASPVPTQPTTKSSGDPVHSIHQAEFEPE IDNEIVI GVDDSNVIPNTLAEINHA 610 

Qy 629 DKKGVIRESYLKGHDQLVPV TL-LAI AVI LAFVMGAVFS 666 

I I : : I : I I : I I | : : : | : | : | | 

Db 611 GSK LPSSQEKLPIYTAETLTIAIVTSCLGALWGFISGFLFS 652 



RESULT 12 
US-08-833-391-58 

; Sequence 58, Application US/08833391 



Patent No. 6013781 
GENERAL INFORMATION : 

APPLICANT: Goodman, Corey S. 
APPLICANT: Kolodkin, Alex L. 
APPLICANT: Matthes, David 
APPLICANT: Bentley, David R. 
APPLICANT: O 1 Connor , Timothy 

TITLE OF INVENTION: The Semaphorin Gene Family 
NUMBER OF SEQUENCES : 100 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 

STREET: 268 Bush Street, Suite 3200 

CITY: San Francisco 

STATE: CA 

COUNTRY: USA 

ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/ 833, 391 

FILING DATE: 

CLASSIFICATION: 530 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/121,713 

FILING DATE: 13-SEP-1993 
ATTORNEY/AGENT INFORMATION: 

NAME: Osman, Richard A. 

REGISTRATION NUMBER: 36,627 

REFERENCE/ DOCKET NUMBER: B94-002-1 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (415) 343-4341 

TELEFAX: (415) 343-4342 

TELEX: 

INFORMATION FOR SEQ ID NO: 58: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 730 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-833-391-58 



Query Match 17.8%; Score 969.5; DB 3; Length 730; 

Best Local Similarity 34.4%; Pred. No. 5.7e-83; 

Matches 244; Conservative 124; Mismatches 224; Indels 117; Gaps 29; 

Qy 8 LYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQR HRLDIQMI 61 

I : I I I : I I III I M I : ::: 

Db 11 LLWVALH— AAAWVNDVSP KMYVQF GEERVQRFLGNESHKDHFKLL 54 

Qy 62 MIMNGTLYIAARDHIYTVDID--TSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHN 119 

: : I : M : : I : : I I I : : : I I I : I : I I I : I : I I 

Db 55 EKDHNSLLVGARNI VYNISLRDLTEFTEQ RIEWHSSGAHRELCYLKGKSEDDCQN 109 



Qy 120 FIKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGD E F S GMARC P YDAKHAN V 174 



: I : I I I : I I : : I I I I I : I I I : I : II hi I I : I I : 

Db 110 YIRVLAKIDDDRVLICGTNAYKPLCRHYALKD GDYWEKEYEGRGLCPFDPDHNST 165 

Qy 175 ALFADGKLYSATVTDFLAI DAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYF 234 

I : :: : I : I I I I I I I I I : I I I I I I : I I I III : : I I : I : I 

Db 166 AIYSEGQLYSATVADFSGTDPLIYRG PLRTERSDLKQLNAPNFVNTMEYNDFIFF 220 

Qy 235 FFREIAVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFN 2 94 

I I I I I I I I I I : : I I I : I I I : I I I : :: I I I I I I : I I I I I I I I I I I I I 

Db 221 FFRETAVEYINCGKAIYSRVARVCKHDKGGPHQGGDR-WTSFLKSRLNCSVPGDYPFYFN 279 

Qy 295 I LQAVT DVI RIN- GRDV VLATFSTPYNSI PGSAVCAYDMLDIASVFTGRFKEQKSPD 350 

:|: :|:| II I : I : I I I M I I I I I I : I I I I 1111— : 
Db 280 EI QSTS DI I EGN YGGQVEKLI YGVFTT PVNS I GGS AVCAFSMKS I LES FDGP FKEQETMN 339 

Qy 351 STWTPVPDERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPW 410 

II II : I I : I I I I I I I I : : I I : I : I M I I I I I : I I I 

Db 340 SNWLAVPSLKVPEPRPGQCVND S RT LP D VS VN FVK S H T LMD EAVP AF FT RP I 391 

Qy 411 FLRTMVRYRLTKIAVD TAAGP YQNHT WFLGS EKGI I LKFLARI GNS GFLN 461 

: I :: I I I I I I I I I I : : | : | : | : : | : : I I II : 

Db 392 LIRISLQYRFTKIAVDQQVRTPDG — KAYDVLFIGTDDGKVIKAL NSAS FDSSDTV 445 

Qy 462 DSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRC — ERHG 519 

I I : : M : I II I : : : : I I I I : : : I I I : : 

Db 446 DSWIEELQVLP PGVPVKNLYWRMDGDDSKLVWSDDEILAIKLHRCGSDKIT 499 

Qy 520 KCKKTCIASRDPYCGWIKEGGACSHL-SPN SRLTFEQDIERGNTDGLGDCHNSFVAL 575 

I :: I :: : I I I I I I : : I I : : I hi I I : = 

Db 500 NCRE-CVSLQDPYCAWDNVELKCTAVGSPDWSAGKRRFIQNISLGEHKACGGRPQTEIV- 557 

Qy 57 6 NGHSSSLLPSTTTSDSTA QEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQ 62 8 

: | : | : I : I : I : I : I : :: II :: I I 

Db 558 AS PVPTQPTTKSSGDPVHSIHQAEFEPE IDNEIVIGVDDSNVI PNTLAEINHA 610 

Qy 629 DKKGVI RES YLKGHDQLVPV TL-LAI AVI LAFVMGAVFS 666 

I I : : h 11:11 | : : : | : | : | | 

Db 611 GSK LPSSQEKLPIYTAETLTIAIVTSCLGALWGFISGFLFS 652 



RESULT 13 
US-09-060-610-58 

Sequence 58, Application US/09060610 
Patent No. 6344544 
GENERAL INFORMATION: 

APPLICANT: Goodman, Corey S. 
APPLICANT: Kolodkin, Alex L. 
APPLICANT: Matthes, David 
APPLICANT: Bentley, David R. 
APPLICANT: O'Connor, Timothy 

TITLE OF INVENTION: The Semaphorin Gene Family 
NUMBER OF SEQUENCES: 100 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 Bush Street, Suite 3200 
CITY: San Francisco 
STATE: CA 



COUNTRY: USA 

ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentln Release #1.0, Version #1.25 

; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/060 , 610 

FILING DATE: 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/835,268 

FILING DATE: 
; ATTORNEY/AGENT INFORMATION: 
; NAME: Osman, Richard A. 

REGISTRATION NUMBER: 36,627 

REFERENCE/ DOCKET NUMBER: B94-002-1 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: (415)343-4341 

TELEFAX: (415) 343-4342 

TELEX: 

; INFORMATION FOR SEQ ID NO: 58: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 730 amino acids 
; TYPE: amino acid 

; TOPOLOGY: linear 

; MOLECULE TYPE: protein 
US-09-060-610-58 

Query Match 17.8%; Score 969.5; DB 4; Length 730; 

Best Local Similarity 34.4%; Pred. No. 5. 7e-83; 

Matches 244; Conservative 124; Mismatches 224; Indels 117; Gaps 29; 

Qy 8 LYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQR HRLDIQMI 61 

I : I I I : I I III I II I : : : : 

Db 11 LLWVALH — AAAWVNDVS P KMYVQF GEERVQRFLGNESHKDHFKLL 54 

Qy 62 MIMNGTLYIAARDHI YTVDID — TSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHN 119 



Db 55 EKDHNSLLVGARNI VYNI SLRDLTEFTEQ RIEWHSSGAHRELCYLKGKSEDDCQN 109 

Qy 120 FIKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGD EFSGMARCPYDAKHANV 174 

: I : I I I : I I : : I I I I I : I I I : I : II hi I I : I I : 

Db 110 YI RVLAKI DDDRVLI CGTNAYKPLCRHYALKD GDYWEKEYEGRGLCPFDPDHNST 165 

Qy 175 ALFADGKLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYF 234 

I :: : : I : I I I I I I I I I : I I I I I I : I I I III : : I I : I : I 

Db 166 AIYSEGQLYSATVADFSGTDPLIYRG PLRTERSDLKQLNAPNFVNTMEYNDFIFF 220 

Qy 235 FFREIAVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFN 294 

I I I I I I I I | | : : | M : | I I : I I I : : : I I I I I I : I I I I II I I I I I I I 
Db 221 FFRETAVEYINCGKAIYSRVARVCKHDKGGPHQGGDR-WTSFLKSRLNCSVPGDYPFYFN 279 

Qy 295 I LQAVT DVI RI N- GRDV VLATFSTPYNS I PGSAVCAYDMLDIASVFTGRFKEQKS PD 350 

: I : : I : I II I : I : II M I I I I I I I : I I | I I I I I : : : 
Db 280 EIQSTSDI I EGNYGGQVEKLI YGVFTTPVNS I GGSAVCAFSMKS I LES FDGPFKEQETMN 339 



Qy 351 STWTPVPDERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPW 410 

II II : I I : I I I I I I M : : I I : I : I I I I I I I I : I I I 

Db 340 SNWLAVPSLKVPEPRPGQCVND S RT L P D VS VN FVK S HT LMD EAVP AF FT R P I 391 

Qy 411 FLRTMVRYRLTKIAVD TAAGP YQNHTWFLGSEKGI I LKFLARI GNSGFLN 4 61 

: I :: I I I I I I I I I I : : I : I : I : : I : : I I II : 

Db 392 LI RI S LQ YRFTKI AVDQQVRT P DG — KAYDVLFIGTDDGKVIKAL NSASFDSSDTV 4 45 

Qy 462 DSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRC--ERHG 519 

I I : : I I : I II I : ::: I III :: : I I I :: 

Db 446 DSWT EELQVLP PGVPVKNLYWRMDGDDSKLVWSDDEILAIKLHRCGSDKIT 499 

Qy 52 0 KCKKTCIASRDPYCGWIKEGGACSHL-SPN SRLTFEQDI ERGNTDGLGDCHNS FVAL 575 

I :: I : : : I I I I I I : : I I : : I hi I I : : 

Db 500 NCRE-CVSLQDPYCAWDNVELKCTAVGSPDWSAGKRRFIQNISLGEHKACGGRPQTEIV- 557 

Qy 576 NGHSSSLLPSTTTSDSTA QEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQ 628 

: I : I : I : I : I : I ': I : : : II :: I I 

Db 558 AS PVPTQPTTKSSGDPVHSIHQAEFEPE 1 DNEI VI GVDDSNVI PNTLAEINHA 610 

Qy 62 9 DKKGVI RES YLKGHDQLVPV TL-LAI AVI LAFVMGAVFS 666 

I I : : I : I I : I I I :: : I : I : I I 

Db 611 GSK LPSSQEKLPIYTAETLTIAIVTSCLGALWGFISGFLFS 652 

RESULT 14 

PCT-US94-10151A-58 

Sequence 58, Application PC/TUS9410151A 
GENERAL INFORMATION : 

APPLICANT: The Regents of the University of California 
TITLE OF INVENTION: The Semaphorin Gene Family 
NUMBER OF SEQUENCES: 66 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: FLEHR HOHBACH TEST ALBRITTON & HERBERT 
STREET: 4 Embarcadero Center, Suite 3400 
CITY: San Francisco 
STATE: CA 
COUNTRY: USA 
ZIP: 94111-4187 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US94/10151A 
FILING DATE: 13-SEP-1994 
CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 
NAME: Osman, Richard A. 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: FP-58750-PC/RAO 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 781-1989 
TELEFAX: (415) 398-3249 
TELEX: 910 277299 FHT UR 



INFORMATION FOR SEQ ID NO: 58: 
SEQUENCE CHARACTERISTICS : 
LENGTH: 730 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
PCT-US94-10151A-5 8 

Query Match 17.8%; Score 969.5; DB 5; Length 730; 

Best Local Similarity 34.4%; Pred. No. 5.7e-83; 

Matches 244; Conservative 124; Mismatches 224; Indels 117; Gaps 29; 

Qy 8 LYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQR HRLDIQMI 61 

I : II 1:11 III I II I : : : : 

Db 11 LLWVALH — AAAWVNDVSP KMYVQF GEERVQRFLGNESHKDHFKLL 54 

Qy 62 MIMNGTLYIAARDHIYTVDID — TSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHN 119 

:■ : I : I I : : I : : I I I : : : I I I : I : I I I : I : I I 

Db 55 EKDHNSLLVGARNIVYNISLRDLTEFTEQ RIEWHSSGAHRELCYLKGKSEDDCQN 109 

Qy 120 FIKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGD EFSGMARCPYDAKHANV 174 

: I : I I I : I I : : I I I I I : I I I : I : II I : I I I : I I : 

Db 110 YI RVLAKI DDDRVLI CGTNAYKPLCRHYALKD GDYWEKEYEGRGLCPFDPDHNST 165 

Qy 175 ALFADGKLYSATVTDFLAI DAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYF 234 

I : : : : I : I I I I I I I I I : I I I I I I : I I I III : : I I : I : I 

Db 166 AIYSEGQLYSATVADFSGTDPLIYRG PLRTERSDLKQLNAPNFVNTMEYNDFIFF 220 

Qy 235 FFREIAVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFN 2 94 

I I I I I I I I I I : : I II : I I I : I I I : : : I I I I I I : I I I I I I I I I I I I I 

Db 221 FFRETAVEYINCGKAIYSRVARVCKHDKGGPHQGGDR-WTSFLKSRLNCSVPGDYPFYFN 279 

Qy 295 I LQ AVT DVI RI N - GRDV VLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPD 350 

: I : : I : I II I : I : I I I I I I I I I I I : I I I INN:: : 
Db 280 EIQSTSDIIEGNYGGQVEKLIYGVFTTPVNSIGGSAVCAFSMKSILESFDGPFKEQETMN 339 

Qy 351 STWTPVPDERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPW 410 

II II : I I : I I I I I I I I : : I I : I : I I I I I I I I : I I I 

Db 340 SNWLAVPSLKVPEPRPGQCVND S RT L P DVS VN FVKS HT LMD EAVP AFFT RP I 391 

Qy 411 FLRTMVRYRLTKIAVD TAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLN 4 61 

: I :: I I I I I I I I I I : : I : I : I : : I : : I I II : 

Db 392 LIRISLQYRFTKIAVDQQVRTPDG — KAYDVLFIGTDDGKVTKAL NSASFDSSDTV 445 

Qy 462 DSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRC — ERHG 519 

I I : : I I : I II I : : : : I III : : : I I I : : 

Db 446 DSWI EELQVLP PGVPVKNLYWRMDGDDSKLVWSDDEILAIKLHRCGSDKIT 499 

Qy 520 KCKKTCIASRDPYCGWI KEGGACSHL- S PN SRLTFEQDIERGNTDGLGDCHNSFVAL 575 

I : : I : : : I I I I I I : : I I : : I I : I I I : : 

Db 500 NCRE- CVS LQD P YCAWDNVELKCTAVGS PDWSAGKRRFI QN I S LGEHKACGGRPQTEI V- 557 

Qy 576 NGHSSSLLPSTTTSDSTA QEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQ 628 

: I : I : I : I : I : I : | : : : II : : I I 

Db 558 ASPVPTQPTTKSSGDPVHSIHQAEFEPE IDNEIVIGVDDSNVIPNTLAEINHA 610 



Qy 



62 9 DKKGVIRESYLKGHDQLVPV TL-LAI 



AVI LAFVMGAVFS 666 



Db 611 GSK LPSSQEKLPIYTAETLTIAIVTSCLGALWGFISGFLFS 652 



RESULT 15 
US-08-121-713D-60 

Sequence 60, Application US/08121713D 
Patent No. 5639856 
GENERAL INFORMATION: 

APPLICANT: Goodman, Corey S. 
APPLICANT: Kolodkin, Alex L. 
APPLICANT: Matthes, David 
APPLICANT: Bentley, David R. 
APPLICANT: O'Connor, Timothy 

TITLE OF INVENTION: The Semaphorin Gene Family 
NUMBER OF SEQUENCES: 100 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 Bush Street, Suite 3200 
CITY: San Francisco 
STATE: CA 
COUNTRY: USA 
ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE : Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/121, 713D 
FILING DATE: 13-SEP-1993 
CLASSIFICATION: 514 
ATTORNEY/AGENT INFORMATION: 
NAME: Osman, Richard A. 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: B94-002-1 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
TELEX: 

INFORMATION FOR SEQ ID NO: 60: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 650 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-121-713D-60 

Query Match 16.1%; Score 880; DB 1; Length 650; 

Best Local Similarity 35.8%; Pred. No. 1.6e-74; 

Matches 207; Conservative 95; Mismatches 194; Indels 82; Gaps 19; 

Qy 114 KDECHNFIKVLLKKNDDALFVCGTNAFNPSCRNYKMD TLEPFGDEFSGMARCPYDA 169 

: I : I I : I ::: : : I I I I I I I : I I I I : III : I I I I I I 

Db 1 EDDCQNYIRIMWPSPGRLFVCGTNSFRPMCNTYIISDSNYTLEA TKNGQAVCPYDP 57 

Qy 170 KHANVALFADGKLYSATVTDFLAI DAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYG 229 



• I • • • I I • I I I I I [ I I 'III I • I "II I III* I 

Db 58 RHNSTSVLADNELYSGTVADFSGSDPIIYRE P LQT EQ YDS L S LNAPN FVS S FTQG 112 

Qy 230 DYIYFFFREIAVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDS 289 

I :: I I I I I I I I I : M : : I I I : I I I I I I I : I I I I I I : I I I I I : I I I 

Db 113 DFVYFFFRETAVEFINCGKAIYSRVARVCKWDKGGPHR- FRNRWTSFLKSRLNCSIPGDY 171 

Qy 290 H FY FN I LQAVT DVI R INGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKE 345 

I I I I : I : : : : : : : I : I I I I I I I II I I I : : I I I I I : I I I 

Db 172 PFYFNEIQSASNLVEGQYGSMSSKLIYGVFNTPSNSIPGSAVCAFALQDIADTFEGQFKE 231 

Qy 346 QKSPDSTWTPVPDERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSI 405 

I : I I I I I I I I I I I : I I I I I I I I I I I I I I I I I : 

Db 232 QTGINSNWLPVNNAKVPDPRPGSC HNDSRALPDPTLNFIKTHSLMDENVPAF 283 

Qy 406 FNRPWFLRTMVRYRLTKIAVD TAAGPYQNHTWFLGSEKGI I LKFL-ARI GNSGFL 460 

I : : I : I I I I I : I I I I I I : : I : I : I : : I I : I : I : I 

Db 284 FS QPI LVRT S T I YRFTQ I AVDAQ I KT P GG — KT YDVI FVGTDHGKI I KSVNAESADSADK 341 

Qy 461 NDSLFLEEMSVYNSEKCSYDGVEDKRIM-GMQLDRASSSLY VAFSTCVIKVPLG 513 

I : : I I : I : : : | : | | | : I : : I : : I 

Db 342 VTSWIEEIDVLTKS EPIRNLEIVRTMQYDQPKDGSYDDGKLIIVTDSQWAIQLH 397 

Qy 514 RC — ERHGKCKKTCIASRDPYCGWIKEGGAC-SHLSPN — SRLTFEQDIERGNTDGLGDC 568 

II :: I : I : I : I I I I I I I I I I : I I I :: I 

Db 398 RCHNDKITSCSE-CVALQDPYCAWDKIAGKCRSHGAPRWLEENYFYQNVATGQ 44 9 

Qy 569 HNSFVALNGHSSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQ 628 

I : : : I : I I II I I I I I 

Db 450 HAACPSGKINSKDANAGEQKGFRNDM DLLDS RRQ 483 

Qy 629 DKKGVIRESYLKGHDQLVPVTLLAIAVI LAFVMGAVFS 666 

I I : : I : : : : : : I : I : : I I 

Db 484 SKDQEIIDNIDKNFEDIINAQYTVETLVMAVLAGSIFS 521 



Search completed: March 24, 2004, 13:17:59 
Job time : 36.7132 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: March 24, 2004, 13:11:23 ; Search time 30.8439 Seconds 

(without alignments) 
3212.214 Million cell updates/sec 

Title: US-09-856-681A-2 
Perfect score: 5450 

1 MRSEALLLYFTLLHFAGAGF PPKPSFAPLSTSMKPNDACT 1030 



Sequence : 
Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 283366 seqs, 96191526 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



283366 



Database 



PIR_78:* 
pirl : * 
pir2:* 
pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
JH0798 

fasciclin IV precursor - American bird grasshopper 

C; Species: Schistocerca americana (American bird grasshopper) 

C;Date: 30-Sep-1993 #sequence_revision 30-Sep-1993 #text_change 07-May-1999 

C; Accession: JH0798 

R;Kolodkin, A.; Matthes, D.J.; O'Connor, T.P.; Patel, N.H.; Admon, A.; Bentley, 
D. ; Goodman, C.S. 
Neuron 9, 831-845, 1992 

A; Title: Fasciclin IV: Sequence, expression, and function during growth cone 
guidance in the grasshopper embryo. 

A; Reference number: JH0798; MUID : 93040225 ; PMID: 1418998 
A; Accession: JH0798 
A; Molecule type: mRNA 
A; Residues: 1-730 <KOL> 

A; Cross-references: GB:L00709; NID:gl60844; PID:gl60845 
A; Experimental source: embryo 

C; Comment: This protein plays a role in growth cone guidance in the developing 
central nervous system. 

C; Keywords: glycoprotein; transmembrane protein 



F; 1-22/Domain : signal sequence #status predicted <SIG> 
F; 23-7 30/ Product : fasciclin IV #status predicted <MAT> 
F;23-627/Domain: extracellular #status predicted <EXT> 
F; 62 8 -65 2 /Domain : transmembrane #status predicted <TMM> 
F; 653-730/Domain: intracellular #status predicted <INT> 

F;44, 71, 163,267,360, 539/Binding site: carbohydrate (Asn) (covalent) #status 
predicted 

Query Match 17.8%; Score 971.5; DB 2; Length 730; 

Best Local Similarity 34.4%; Pred. No. 5.7e-57; 

Matches 244; Conservative 124; Mismatches 224; Indels 117; Gaps 



Qy 


8 


LYFTLLHFAGAGFPEDSEPIS I SHGNYTKQYPVFVGHKPGRNTTQR HRLDIQMI 

1 : 1 1 1 : 1 1 III 1 II 1 : : : = 
LLWVALH — AAAWVNDVSP KMYVQF GEERVQRFLGNESHKDHFKLL 


61 


Db 


11 


54 


Qy 


62 


MIMNGTLYIAARDHIYTVDID — TSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHN 
EKDHNSLLVGARNIVYNISLRDLTEFTEQ RIEWHSSGAHRELCYLKGKSEDDCQN 


119 


Db 


55 


109 


Qy 


120 


FIKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGD EFSGMARCPYDAKHANV 

: 1 : 1 1 1 : 1 1 : : 1 1 1 1 1 : 1 1 1 : 1 : II 1 : 1 1 1 = 1 1 : 
YI RVLAKI DDDRVLI CGTNAYKPLCRHYALKD GDYWEKEYEGRGLCPFDPDHNST 


174 


Db 


110 


165 


Qy 


175 


ALFADGKLYSATVTDFLAI DAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYI YF 

1 : : : : 1 : 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 : 1 1 1 III : : 1 1 : 1 : 1 

AIYSEGQLYSATVADFSGTDPLIYRG PLRTERSDLKQLNAPNFVNTMEYNDFIFF 


234 


Db 


166 


220 


Qy 


235 


FFREIAVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFN 

I I I I MM II : : 1 1 1 M II : 1 II • ' • 1 M 1 1 1 M M 1 1 1 M 1 MM 

FFRETAVEYINCGKAIYSRVARVCKHDKGGPHQFGDR-WTSFLKSRLNCSVPGDYPFYFN 


294 


Db 


221 


279 


Qy 


295 


ILQAVTDVIRIN-GRDV VLAT FSTP YNS I PGSAVCAYDMLDIASVFTGRFKEQKS PD 

: I : : I : 1 II 1 : MM Ml 1 II II M 1 1- 1 1 Mil:: : 
EIQSTSDIIEGNYGGQVEKLIYGVFTTPVNSIGGSAVCAFSMKSILESFDGPFKEQETMN 


350 


Db 


280 


339 


Qy 


351 


STWTPVPDERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPW 

| | || MMMM 1 1 M ::IMMI 1111111= 1 1 1 
SNWLAVPSLKVPEPRPGQCVND S RT L P DVS VN FVKS HT LMDEAVPAFFT RP I 


410 


Db 


340 


391 


Qy 


411 


FLRTMVRYRLTKIAVD TAAGPYQNHTWFLGSEKGI I LKFLARI GNSGFLN 

: 1 : M 1 1 II 1 II 1 1 : : M M 1 : : 1 : M 1 II : 
LI RI SLQYRFTKIAVDQQVRT PDG — KAYDVLFI GTDDGKVI KAL NSASFDSSDTV 


461 


Db 


392 


445 


Qy 


4 62 


DSLFLEEMSVYNS EKCS YDGVEDKRIMGMQLDRAS S SLYVAFSTCVI KVPLGRC — ERHG 
||: MM 1 || | : ::M III :: : I || :: 
DSWIEELQVLP PGVPVKNLYWRMDGDDSKLVWSDDEILAIKLHRCGSDKIT 


519 


Db 


446 


499 


Qy 


520 


KCKKTCI ASRDPYCGWI KEGGACSHL- S PN SRLTFEQDIERGNTDGLGDCHNSFVAL 

1 : : M : Mill 1 1 : : 1 1 : : 1 1 : 1 1 1 : : 
NCRE-CVSLQDPYCAWDNVELKCTAVGSPDWSAGKRRFIQNISLGEHKACGGRPQTEIV- 


575 


Db 


500 


557 


Qy 


576 


NGHSSSLLPSTTTSDSTA--- QEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQ 

M :M 1: M 1 M :| : :: II :: II 
ASPVPTQPTTKSSGDPVHSIHQAEFEPE IDNEIVIGVDDSNVIPNTLAEINHA 


628 


Db 


558 


610 


Qy 


629 


DKKGVI RES YLKGHDQLVPV TL-LAI AVI LAFVMGAVFS 666 





611 GSK LPSSQEKLPIYTAETLTIAIVTSCLGALWGFISGFLFS 652 



RESULT 2 
B49423 

semaphorin I - fruit fly (Drosophila melanogaster ) (fragment) 
C; Species: Drosophila melanogaster 

C;Date: 06-Jan-1995 #sequence_revision 06-Jan-1995 #text__change 07-May-1999 
C; Accession: B49423 

R;Kolodkin, A.L.; Matthes, D.J.; Goodman, C.S. 
Cell 75, 1389-1399, 1993 

A;Title: The Semaphorin genes encode a family of transmembrane and secreted 
growth cone guidance molecules . 

A;Reference number: A49423; MUID : 94094332 ; PMID: 8269517 
A; Accession: B49423 

A; Status: preliminary; nucleic acid sequence not shown; not compared with 

conceptual translation 

A;Molecule type: mRNA 

A; Residues: 1-656 <KOL> 

A;Cross-references: GB:L26082 

C;Genetics : 

A; Gene: semal 

A;Cross-references : FlyBase : FBgn0011259 

Query Match 16.1%; Score 875; DB 2; Length 656; 

Best Local Similarity 35.6%; Pred. No. 1.4e-50; 

Matches 208; Conservative 95; Mismatches 193; Indels 88; Gaps 20; 



Qy 


114 


KDECHNFIKVLLKKNDDALFVCGTNAFNPSCRNYKMD TLEPFGDEFSGMARCPYDA 

: | : I I : I : : : : : 1 1 1 1 1 1 1 : 1 1 1 1 : III : 1 1 1 1 1 1 

EDDCQNYIRIMWPSPGRLFVCGTNSFRPMCNTYIISDSNYTLEA TKNGQAVCPYDP 


169 


Db 


1 


57 


Qy 


170 


KHANVALFADGKLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYG 

: I : : : 1 1 : 1 M 1 1 1 1 1 : 1 1 1 1 : 1 : : 1 1 1 III: 1 
RHNSTSVLADNELYSGTVADFSGSDPII YRE PLQTEQ YDSLSLNAPNFVS S FTQG 


229 


Db 


58 


112 


Qy 


230 


DYIYFFFREIAVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDS 

I :: I I I I I I I 1 1 : 1 1 : : 1 1 I : 1 1 1 1 1 1 1 : 1 1 1 1 1 1 : 1 1 1 1 1 : 1 M 

DFVYFFFRETAVEFINCGKAIYSRVARVCKWDKGGPHR- FRNRWTSFLKSRLNCSIPGDY 


289 


Db 


113 


171 


Qy 


290 


H FYFN I LQAVT DVI R INGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKE 

| | | | : I : : : : : : : 1 : 1 1 1 1 1 1 1 1 1 1 1 1 : : 1 II 1 1 : 1 1 1 

PFYFNEIQSASNLVEGQYGSMSSKLIYGVFNTPSNSIPGSAVCAFALQDIADTFEGQFKE 


345 


Db 


172 


231 


Qy 


346 


QKSPDSTWTPVPDERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSI 
I :| 1 II : :M Mil 1 : 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 : 
QTGINSNWLPVNNAKVPDPRPGSC HNDSRALPDPTLNFIKTHSLMDENVPAF 


405 


Db 


232 


283 


Qy 


406 


FNRPWFLRTMVRYRLTKIAVD TAAGPYQNHTWFLGSEKGIILKFL-ARIGNSGFL 


460 


Db 


284 


|::| :|| 111:1111 1 1 : : 1 : 1 : 1 : : 1 1 : 1 : 1 :l 
FSQPILVRTSTIYRFTQIAVDAQIKTPGG — KTYDVI FVGTDHGKI IKSVNAESADSADK 


341 


Qy 


461 


NDS LFLEEMSVYNS EKC S YDGVEDKRIM- GMQLDRAS S S LY VAFSTCVIKVPLG 

| : : | | : | : : : 1 : 1 1 1 : 1 : : 1 : : 1 
VTSWIEEIDVLTKS EPIRNLEIVRTMQYDQPKDGSYDDGKLII VTDSQWAIQLH 


513 


Db 


342 


397 



Qy 514 RC — ERHGKCKKTCIASRDPYCGWIKEGGAC-SHLSPN — SRLTFEQDIERGNTDGLGDC 568 

II : : I : I : I : I I I I I I I I I I : I I I - I 

Db 398 RCHNDKITSCSE-CVALQDPYCAWDKIAGKCRSHGAPRWLEENYFYQNVATGQ 449 

Qy 569 HNSFVALNGHSSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQ 628 

I : : : I : I I I I I I I II I 
Db 450 HAACPSGKINSKDANAGEGKGFRNDM DLLDS RRQ 4 83 

Qy 629 DKKGVIRESYLKGHD QLVPVTLLAIAVILAFVMGAVFS 666 

| | :: | : :: : : : I : I : : I I 

Db 484 SKDQEI I DNI DKNFEGPQTSADI INAQYTVETLVMAVLAGS I FS 527 



RESULT 3 
D49423 

semaphorin III precursor - human t 
C; Species: Homo sapiens (man) 

C;Date: 06-Jan-1995 #sequence_revision 06-Jan-1995 #text_change 24-Sep-1999 
C; Accession: D49423 

R;Kolodkin, A.L.; Matthes, D.J.; Goodman, C.S. 
Cell 75, 1389-1399, 1993 

A;Title: The Semaphorin genes encode a family of transmembrane and secreted 
growth cone guidance molecules . 

A; Reference number: A49423; MUID : 94094332 ; PMID: 8269517 
A;Accession: D49423 

A; Status: preliminary; nucleic acid sequence not shown 
A;Molecule type: mRNA 
A; Residues: 1-771 <KOL> 

A;Cross-references: GB:L26081; NID:g799328; PIDN: AAA65938 . 1; PID:g436560 

C; Genetics : 

A; Gene: GDB : SEMAl 

A; Cross-references : GDB: 283448 

C; Super family : semaphorin 

Query Match 15.6%; Score 8 52.5; DB 2; Length 771; 

Best Local Similarity 32.1%; Pred. No. 5.8e-49; 

Matches 208; Conservative 106; Mismatches 250; Indels 83; Gaps 20; 

PGRNTTQRHRLDIQMIMIMNGT LYIAARDHI YTVD 8 0 

|:| | : | : : : I I I : I : I I I : : I 



I : I II: II II I I I I I I I hill 



Qy 


44 


Db 


22 


Qy 


81 


Db 


82 


Qy 


140 


Db 


137 


Qy 


193 


Db 


197 


Qy 


246 


Db 


257 



| : | | :: I : : : I : I I I I : I I I : I I I I II 



I : I : I I : I I : I II : I I : I I : I I : I I I I I I I 



[GKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPG DSHFYFNILQAVTD 301 

II I : I : I I I I I I II : I I : I I I I I I I I II I I : I I : I 

IGKATHARIGOICKNDFGG-HRSLWKWTTFLKARLICSVPGPNGIDTHF DELQD 310 



Qy 302 VIRINGRD WLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVP 357 

I : I : I II I : I I I I I I I I I I : III: ' I : I I 

Db 311 VFLMNFKDPKNPWYGVFTTSSNIFKGSAVCMYSMSDVRRVFLGPYAHRDGPNYQWVPY- 369 



Qy 358 DERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVR 417 

I I I I I I I I I : : : : : | | | : I : : I I I I : I I I : : I I 

Db 370 QGRVPYPRPGTCP-SKTFGGFDSTKDLPDDVITFARSHPAMYNPVFPMNNRPIVI KTDVN 428 



Qy 418 YRLTKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKC 477 

I : I : I I I : | : | : | : : | : | | : : : : : : I I I I : I : 
Db 42 9 YQFTQI VVDRVDAEDGQYDVMFI GTDVGTVLKWS I PKETWYDLEEVLLEEMTVFR 484 



Qy 47 8 SYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIK 537 

I I hi II: : I : : I I I I : : I I I : I I I I I I 

Db 485 EPTAISAMELSTKQQQLYIGSTAGVAQLPLHRCDIYGKACAECCLARDPYCAW — 537 

Qy 538 EGGACSHLSPNS-RLTFEQDIERGNTDGLGDC HNSFVALNGHS — SSLLPSTTTSD 590 

: I I I I I : I I I I I I I I I I : : : I I I : : I 

Db 538 DGSACSRYFPTAKRRTRRQDIRNG--DPLTHCSDLHHDNH HGHSPEERIIYGVENSS 592 



Qy 591 STAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RES 637 

: : : | : : : | : : : : : | | : : I : : I 

Db 593 TFLECS PKSQRALVYWQFQRRNEERKEEI-RVDDHI IRTDQGLLLRS 638 



RESULT 4 
A49423 

semaphorin I precursor - beetle (Tribolium confusum) 
C; Species: Tribolium confusum 

C;Date: 19-Mar-1997 #sequence_revision 19-Mar-1997 #text_change 07-May-1999 
C;Accession: A49423 

R;Kolodkin, A.L.; Matthes, D.J.; Goodman, C.S. 
Cell 75, 1389-1399, 1993 

A; Title: The Semaphorin genes encode a family of transmembrane and secreted 
growth cone guidance molecules. 

A; Reference number: A49423; MUID : 94094332 ; PMID:8269517 
A; Accession: A49423 

A; Status: preliminary; nucleic acid sequence not shown; not compared with 

conceptual translation 

A;Molecule type: mRNA 

A; Residues: 1-711 <KOL> 

A; Cross-references : GB: L26080 

Query Match 15.5%; Score 845.5; DB 2; Length 711; 

Best Local Similarity 28.9%; Pred. No. 1.5e-48; 

Matches 230; Conservative 130; Mismatches 270; Indels 165; Gaps 30; 

Qy 12 LLHFAGAGFPEDSEPI SISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMIMIMN- 65 

I : I I : I : I : : : I II I : : : I 

Db 12 LIALCHAWMPDSSSKLINHFKSVESKSFT GNATFPDH FIVLNQ 54 

Qy 66 — GTLYIAARDHIYTVDI-DTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNFIK 122 

:: : I : : I : I I I : :: I I I I : I I I 1 = 1 hh 

Db 55 DETS I LVGGRNRVYNLS I FDLS ERK GGRIDWPSSDAHGQLCILKGKTDDDCQNYIR 110 

Qy 123 VLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGD-EFSGMARCPYDAKHANVALFADGK 181 



: I I : I I I I :: I I I I : -: I I : I I I : : I : : : : I : 

Db 111 ILYSSEPGKLVICGTNSYKPLCRTYAFKEGKYLVEKEVEGIGLCPYNPEHNSTSVSYNGQ 170 

Qy 182 LYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAV 241 

I : I I I I I I I : I I I II III I I I : I I I I I I : I I : I I I I 

Db 171 LFSATVADFSGGDPLIYRE PQRTELSDLKQLNAPNFVNSVAYGDYIFFFYRETAV 225 

Qy 242 EYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVTD 301 

II I I I :: I I I : I I I : I I I : : : I I I I I I I I I I I I : I I : I I I : : I : : I 

Db 226 EYMNCGKVIYSRVARVCKDDKGGPHQSRDR-WTSFLKARLNCSIPGEYPFYFDEIQSTSD 284 

■> 

Qy 302 VI — RINGRD — WLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVP 357 

: : II I : : : I I I : I I I I : I I I I I I I I I M I : : : I I I I I 

Db 285 IVEGRYNSDDSKIIYGILTTPVNAIGGSAICAYQMADILRVFEGSFKHQETINSNWLPVP 344 

Qy 358 DERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVR 417 

11:11111 I : II : | | | | | | | | : : I I : : I : I : I : : 

Db 345 QNLVPEPRPGQCVRDSRI L P D KN VN F I KT H S LME D - VP AL FG K P VL VRVS LQ 395 

Qy 418 YRLTKIAVDTAAGPYQNH — TWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSE 475 

I I I I I I I | : : : | : : | : M 
Db 396 YRFTAITVDPQVKTINNQYLDVLYIGTDDGKVLK 42 9 

Qy 476 KCSYDGVEDKRIMGMQLDRASSSLYVAFSTCV IKVP 511 

: : I : I I : I I : I : 

Db 430 AVNIPKRHAKALLYRKYRTSVHPHGAPVKQLKIAPGYGKWWGKDEIR 478 

Qy 512 LGRCERHGKCKKTCIASRDPYCGWIKEGGACSHLSPNSRLTF-EQDIERGN 561 

I I : I I I : : I I : I I : I : : I I I : I I : 

Db 479 LANLNHCASKTRC-KDCVELQDPHCAWDAKQNLCVSIDTVTSYRFLIQDWRGDDNKCWS 537 

Qy 562 — TDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPL 619 

II I II :: I :| I I I I = I i I 

Db 538 PQTDKKTVIKNK PSEVENEIT NSIDEKDL DSSDPL 572 

Qy 620 GAVSSHNHQDKKGVI RESYLKG HDQLVPVT — L LAI AVI LAFVMGA VFSGITVY 671 

: I I I : : I III I III:: : : I : : | | : : | : 

Db 573 I KTGLDDDS DCDPV- S ENS I GGCAVRQQLVI YTAGTLHI VWWS I VGLFSWL YS GLS VF 631 

Qy 672 CV — CDHRRKDVAWQRKE — KELTHSRRGSMS-SVTKLSGLFGDTQSKDPKPEAILTPL 726 

I : : : : : : I : : : I : : I I I I : I : : 

Db 632 AKFHSDSQYPEAPFIEQHNHLERLSANQTGYLTPRANKAVNLWKVSSSTPRPKKDNLDV 691 

Qy 727 MHNGKLATPGNTAKM 741 

: : I : I I : 
Db 692 SKDLNIASDGTLQKI 7 06 



RESULT 5 
148747 

semaphorin D - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 02-Jul-1996 #sequence_revision 02-Jul-1996 #text_change 24-Sep-1999 
C; Accession: 148747 

R;Puschel, A.W.; Adams, R.H.; Betz, H . 
Neuron 14, 941-948, 1995 



A; Title: Murine semaphorin D/collapsin is a member of a diverse gene family and 

creates domains inhibitory for axonal extension. 

A; Reference number: 148744; MUID: 95267431; PMID: 7748561 

A;Accession: 148747 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-772 <RES> 

A; Cross-references: EMBL:X85993; NID:g854329; PIDN : CAA59985 . 1 ; PID:g854330 
C; Genetics : 
A; Gene: semD 

C; Superf amily : semaphorin 

Query Match 15.5%; Score 842.5; DB 2; Length 772; 

Best Local Similarity 34.3%; Pred. No. 2.7e-48; 

Matches 200; Conservative 85; Mismatches 225; Indels 73; Gaps 16; 

Qy 47 GRNTTQRHRLDIQMIMIMNGT LYIAARDHI YTVDIDT 83 

I : I I : I : : : I I I : I : I II : : : : 

Db 25 GKNNVPRLKLSYKEMLESNNVITFNGLANSSSYHTFLLDEERSRLYVGAKDHIFSFNL — 82 

Qy 84 SHTEEIYCSKKLTWKSRQADVDTCRMKGKH-KDECHNFIKVLLKKNDDALFVCGTNAFNP 142 

I : I : I II: II I I I I I I I I I I : I I I I I : I 

Db 83 VNIKDFQKIVWPVSYTRRDECKWAGKDILKECANFIKVLEAYNQTHLYACGTGAFHP 139 

Qy 143 SCR NYKMDTLEPFGDEF — SGMARCPYDAKHANVALFADGKLYSAT VTDFLAIDA 195 

I : : I : I : I : I I I I : I I I : I I I I I I : I 

Db 140 ICTYIEVGHHPEDNIFKLQDSHFENGRGKSPYDPKLLTASLLIDGELYSGTAADFMGRDF 199 

Qy 196 VIYRSLGESPTLRTVKHDSKWLKEPYFVQA VDYGDYI YFFFREIAVEYNTMGK 248 

I : I : I I : : I I : I I I : I I : I I : I I : I I I I I I I : I I 

Db 200 AIFRTLGDHHPIRTEQHDSRWLNDPRFISAHLIPESDNPEDDKVYFFFRENAIGGEHSGK 259 

Qy 249 WFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPG DSHFYFNILQAVTDVIR 304 

I : I : I I I I I I II : M : I I I I I I II I I I I : I I : I I 

Db 2 60 ATHARIGQICKNDFGG-HRSLVNKWTTFLKARLICSVPGPNGIDTHF DELQDVFL 313 

Qy 305 INGRD WLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

: I : I : I I : I I I I I I I I I I : III: : I : II I 

Db 314 MNS KDPKN P I VYGVFTT S SNI FKGSAVCMYSMS DVRRVFLGP YAHRDGPN YQWVP Y-QGR 372 

Qy 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

II II I I I I : : : : : | | | : | :: I I I I I I I I :: I I I : 

Db 373 VPYPRPGTCP-SKTFGGFDSTKDLPDDVITFGRSHPAMYNPVFPINNRPIMIKTDVNYQF 431 

Qy 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 4 80 

I : I I I : | : | : | : : | : | | : : : : : I I I I : I : 
Db 4 32 TQIWDRVDAEDGQYDVMFIGTDVGTVLKWSVPKETWHDLEEVLLEEMTVFR 4 84 

Qy 481 GVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 540 

I I 1:1 II: : I : : I I I I : : I I I : I I I I I I : I 

Db 485 — EPTTISAMELSTKQQQLYIGSTAGVAQLPLHRCDIYGKACAECCLARDPYCAW — DGS 540 

Qy 541 ACSHLSPNS-RLTFEQDIERGNTDGLGDCHNSFVALNGHSSSL 582 

: I I I : I I I I I I I I I : I I I I 

Db 541 SCSRYFPTAKRRTRRQDIRNG — DPLTHCSDLEDHDNHHGPSL 581 



RESULT 6 
A49069 

collapsin - chicken 

C; Species: Gallus gallus (chicken) 

C;Date: 07-Apr-1994 #sequence_revision 07-Apr-1994 #text_change 24-Sep-1999 
C; Accession: A4 9069 

R;Luo, Y. ; Raible, D.; Raper, J. A. 
Cell 75, 217-227, 1993 

A;Title: Collapsin: a protein in brain that induces the collapse and paralysis 
of neuronal growth cones. 

A; Reference number: A49069; MUID : 94006554 ; PMID: 8402908 
A; Accession: A4 9069 

A; Status: preliminary; not compared with conceptual translation 
A; Molecule type: mRNA 
A; Residues: 1-772 <LU0> 

A; Cross-references: GB:U02528; NID:g410078; PIDN : AAC59638 . 1 ; PID:g410079 
C; Super family : semaphorin 

Query Match 15.4%; Score 839.5; DB 2; Length 772; 

Best Local Similarity 35.2%; Pred. No. 4.3e-48; 

Matches 198; Conservative 87; Mismatches 220; Indels 57; Gaps 17; 

Qy 68 LYITW^DHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKH-KDECHNFIKVLLK 126 

I I : I : I I I : : : : I : I : I : I I : II I I I I I I I I 

Db 69 LYVGAKDHIFSFNL VNIKEYQKIVWPVSHSRRDECKWAGKDILRECANFIKVLKT 123 

Qy 127 KNDDALFVCGTNAFNPSCR NYKMDTLEPFGDEF — S GMARC P Y DAKHAN VAL FAD 179 

I I : I I I I I : I I : : I : I : I : I I I I : I I 

Db 124 YNQTHLYACGTGAFHPMCTYIEVGSHPEDNIFRMEDSHFENGRGKSPYDPKLLTASLLVD 183 

Qy 180 GKLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQA VDYGDYI 232 

I : I I I I II: I I : I : I I : I I : I I I : I I : I I : I II 

Db 184 GELYSGTAADFMGRDFAI FRTLGHHHPIRTEQHDSRWLNDPRFI SAHLI PESDNPEDDKI 243 

Qy 233 YFFFREIAVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPG D 288 

I I I I I I I : : II I : I : I I I I I I II : I I : I I I I I I I I I I I I 

Db 244 YFFFRENAI DGEHTGKATHARI GQI CKNDFGG-HRSLVNKWTTFLKARLI CSVPGPNGI D 302 

Qy 289 SHFYFNILQAVTDVIRINGRD WLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFK 344 

: II : I I : I : I : I hi I I I I I I I I I : III: 

Db 303 THF DELQDVFLMNSKDPKNPIVYGVFTTSSNIFKGSAVCMYSMTDVRRVFLGPYA 357 

Qy 345 EQKSPDSTWTPVPDERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPS 4 04 

: |: II III MM I I : : : : : I I : : I : : I I I I 

Db 358 HRDGPNYQWVPY-QGRVPYPRPGTCP-SKTFGGFDSTKDLPDEVITFARSHPAMYNPVFP 415 

Qy 4 05 IFNRPWFLRTMVRYRLTKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSL 4 64 

1:11 : : I I I : I : I I I : I : I : I : : I : II : : : : : 

Db 416 INSRPIMIKTDVDYQFTQIVVTJRVDAEDGQYDVMFIGTDIGTVTjKWSIPKETWHELEEV 475 

Qy 4 65 FLEEMSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVIKVPLGRCERHGKCKKT 524 

1111 = 1= I II:: II: : I I — II I I : Ml 

Db 476 LLEEMTVFR EPTVI SAMKI STKQQQLYI GSATGVSQLPLHRCDVYGKACAE 526 

Qy 525 CIASRDPYCGWIKEGGACSHLSPNS-RLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLL 583 

I M I I I I I : I Ml I : I I III I II I : I I : 
Db 527 CCLARDPYCAW — DGSSCSRYFPTAKRRTRRQDIRNG — DPLTHCSD LQHHDN 575 



Qy 584 PSTTTSDSTAQEGYESRGGMLD 605 

III: II: I : 

Db 576 PSGQTLEEKIIYGVENSSTFLE 597 



RESULT 7 
JC5928 

semaphorin F precursor - human 
C; Species: Homo sapiens (man) 

C;Date: 10-Apr-1998 #sequence_revision 08-May-1998 #text_change 17-Nov-2000 
C; Accession: JC5928 

R;Simmons, A.D.; Pueschel, A.W. ; McPherson, J.D.; Overhauser, J.; Lovett, M. 
Biochem. Biophys . Res. Commun. 242, 685-691, 1998 

A; Title: Molecular cloning and mapping of human semaphorin F from the Cri-du- 
chat candidate interval. 

A; Reference number: JC5928; MUID: 98125554 ; PMID: 9464278 
A; Accession: JC592 8 

A; Status: nucleic acid sequence not shown 
A;Molecule type: mRNA 
A; Residues: 1-1074 <SIM> 

A;Cross-references: GB:U52840; NID : g2772583 ; PIDN: AAC09473 . 1; PID:g2772584 
A; Experimental source: brain 

C; Comment: This protein disrupts normal brain development and leads to some of 
the features of Cri-du-chat. 
C; Genetics: 
A; Gene: semaf 

C; Superf amily : human semaphorin F; thrombospondin type 1 repeat homology 

F; 1-20/Domain: signal sequence ((status predicted <SIG> 

F; 50-533/Domain: semaphorin #status predicted <SEM> 

F; 8 4 0-8 9 6/ Domain : thrombospondin type 1 repeat homology <THR3> 

F; 971-993/Domain : transmembrane ((status predicted <TMM> 

Query Match 15.2%; Score 830; DB 2; Length 1074; 

Best Local Similarity 36.2%; Pred. No. 3.1e-47; 

Matches 199; Conservative 90; Mismatches 211; Indels 50; Gaps 17; 

Qy 68 LYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNFIKVLLKK 127 

I : I I : : : : : : I : : : : I : : I I I I I I : I I I : I : I I I 

Db 70 LWGARNYLFRLQL EDLSLIQAVEWECDEATKKACYSKGKSKEECQNYIRVLL-V 123 

Qy 128 NDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALF-ADGKLYSAT 18 6 

I I I I I I I I I I I I : I I : I I I I I I I I : I : I I I I : I I : I I 

Db 124 GGDRLFTCGTNAFTPVCTNRSLSNLAEIHDQISGMARCPYSPQHNSTALLTAGGELYAAT 183 

Qy 187 VTDFLAI DAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVEYNTM 246 

II I I I I I I I I I I I : :: I I I I I I I I : I I : : I I I I I I I I I :: 

Db 184 AMDFPGRDPAIYRSLGILPPLRTAQYNSKWLNEPNFVSSYDIGNFTYFFFRENAVEHD-C 242 

Qy 247 GKWF P RVAQ VC KN DMG G S Q RVL E KQWT S FL KARLN C S VP G D S H F Y FN I LQAVT D VI RI N 306 

I I I I I I : I I I I I : I I : : I I I I : I : M I I I I I I I : II : I I I : : : 
Db 243 GKTVFSRAARVCKNDIGG-RFLLEDTWTTFMKARLNCSRPGEVPFYYNELQSTFFLPEL- 300 

Qy 307 GRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDERVPKPRP 366 

I : : I : I III MM::: II | : | | | | : : I I I I : I | 
Db 301 — DLI YGI FTTNVNS I AASAVCVFNLS AI AQAFS GP FKYQEN S RSAWLP YPN PNPHF 355 



Qy 367 GCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRLTKIAVD 426 

I I I I III II: II: I : : : I I I 

Db 356 QCGTVDQGLYVNLTERNL-QDAQKFI LVH EWQPVTTVPS FMEDNSRF- - SHVAVD 4 08 

Qy 427 TAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVEDKR 486 

I :::| :: I I I : : : I III: :: : : 
Db 409 WQGREALVHI I YLATDYGTI KKVRVPLNQT SSSCLLEEIELFPERR REP 458 

Qy 487 IMGMQLDRAS S SLYVAFSTCVT KVPLGRCERHGKCKKTCI ASRDPYCGWI KEGGACSHLS 546 

I :h : I hi h h I I II : : : : I I I : : I I I I I I hi 

Db 459 IRSLQILHSQSVLFVGLREHWKIPLKRCQFY-RTRSTCIGAQDPYCGWDWMKKCTSLE 517 

QY 547 PNSRLT-FEQDIERGNTDGLGDCHNSFVALNGHSSSLLPST — TTSDSTAQEGYESRGGM 603 

: : I : I I I I : : : II I I I : I : I I 
Db 518 ESLSMTQWEQSISA CPTRNLTVDGHFGVWS PWTPCTHTDGSAV GSC 563 

Qy 604 LDWKHLLDSP 613 

I III 
Db 564 LCRTRSCDSP 573 



RESULT 8 
G01856 

semaphorin V - human 

C; Species: Homo sapiens (man) 

C;Date: 21-Dec-1996 #sequence_revision 06-Jun-1997 #text_change 24-Sep-1999 
C; Access ion: G01856 
R;Sekido, Y. 

submitted to the EMBL Data Library, June 1995 
A; Reference number: G08634 
A; Accession: G01856 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-749 <SEK> 

A;Cross-references: EMBL:U28369; NID:g974283; PIDN : AAD09138 . 1 ; PID:g974284 
C; Super family : semaphorin 

Query Match 15.2%; Score 826; DB 2; Length 749; 

Best Local Similarity 34.7%; Pred. No. 3.3e-47; 

Matches 198; Conservative 88; Mismatches 228; Indels 56; Gaps 16; 

Qy 66 GTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKH-KDECHNFIKVL 124 

I I : : I : I : : : : : I : I I I I : : I I I I I I I : |: I 

Db 65 G RL FVGAENHVAS LN L DN I S KR AKKLAWPAPVEWREECNWAGKDIGTECMNFVKLL 120 

Qy 125 LKKNDDALFVCGTNAFNPSCRNYKMD — TLEPFGDEFSGMARCPYDAKHANVAL 176 

Ihhl :: hi I I : III :| :: 

Db 121 HAYNRTHLLACGTGAFHPTCAFVEVGHRAEEPVLRLDP-GRIEDGKGKSPYDPRHRAASV 179 

Qy 177 FADGKLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAV DYG 22 9 

: I I I I : I h I I I I : h I I I I I I : I I I I I h 

Db 180 LVGEELYSGVAADLMGRDFTIFRSLGQRPSLRTEPHDSRWLNEPKFVKVFWIPESENPDD 239 

Qy 230 DYIYFFFREIAVE-YNTMGfCWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVP — 286 

I I I I I I I I I I I : h : I I I : I : I I : I I i I I : I I : I I II I I I I I I 

Db 24 0 DKIYFFFRETAVEAAPALGRLSVSRVGQICRNDVGG-QRSLVNKWTTFLKARLVCSVPGV 298 



Qy 287 - GD S H F YFN I LQAVT DVI RI NGRD WLATFSTPYNSIPGSAVCAYDMLDIASVFTG 341 

M : M : | | : : | | :: I II : I I I I I I I I : II 

Db 299 EGDTHF DQLQDVFLLS SRDHRT PLLYAVFSTSS S I FQGSAVCVYSMNDVRRAFLG 353 

Qy 342 RFKEQKSPDSTWTPVPDERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEA 401 

I :: I I III I I I I I I : : : : : : I II I : I : I I I I : 

Db 354 PFAHKEGPMHQWSY-QGRVPYPRPGMCP-SKTFGTFSSTKDFPDDVIQFARNHPLMYNS 411 

Qy 402 VPS I FNRPWFLRTMVRYRLTKIAVDTAAGPYQNHT WFLGSEKGI I LKFLARI GNSGFLN 4 61 

I MM: | | : | | | | : : | : | : | : : | : | | : : | 

Db 412 VLPTGGRPLFLQVGANYTFTQIAADRVAAADGHYDVLFIGTDVGTVLKVI SVPKGSRPSA 471 

Qy 462 DSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKC 521 

= I I M : I : : : || : MM : I : : I II M : 

Db 472 EGLLLEELHVFE DSAAVTSMQISSKRHQLYVASRSAVAQIALHRCAAHGRV 522 

Qy 522 KKTCIASRDPYCGWIKEGGACSHLSPNSRLTF-EQDIERGNTDGLGDCHNSFVALNGH — 578 

I MIMI I : I II: |::: | ||: |: | :| || | 

Db 523 CTECCLARDPYCAW — DGVACTRFQPSAKRRFRRQDVRNGDPSTLCSGDSSRPALLEHKV 580 

Qy 579 SSSLLPSTTTSDSTAQEGYESRGGM 603 

M : I I I M: 

Db 581 FGVEGS S AF LE C E P RS LQARVEWT FQ RAGV 610 



RESULT 9 
148744 

semaphorin A - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 02-Jul-1996 #sequence_revision 02-Jul-1996 #text_change 24-Sep-1999 
C; Accession: 148744 

R;Puschel, A.W.; Adams, R.H.; Betz, H. 
Neuron 14, 941-948, 1995 

A; Title: Murine semaphorin D/collapsin is a member of a diverse gene family and 
creates domains inhibitory for axonal extension. 
A;Reference number: 148744; MUID: 95267431; PMID:7748561 
A;Accession: 148744 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-748 <RES> 

A; Cross-references : EMBL:X85990; NID:g854323; PIDN : CAA59982 . 1 ; PID:g854324 
C; Genetics : 
A; Gene: semA 

C; Superfamily : semaphorin 

Query Match 14.7%; Score 803; DB 2; Length 74 8; 

Best Local Similarity 35.5%; Pred. No. l.le-45; 

Matches 191; Conservative 80; Mismatches 205; Indels 62; Gaps 18; 

Qy 66 GTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKH-KDECHNFIKVL 124 

I I : : I : I : : : : I Mill: : I II I I I I : : : I 

Db 65 G RL FVGAENH VAS LSLDNISKR AKKLAWPAPVEWREECNWAGKDIGTECMNFVRLL 120 

Qy 125 LKKN DDAL FVCGTNAFN P S C RN YKMDT L EPFGDEFSGMARCPYDAKHANVA 175 

I I I I M : I : I : : I I I I : III :| : 
Db 121 HAYNHT H LLACRT GAFH P T C ALW RWAT AGGT HAS TGPEKLED GKGKTPYDPRHRPPS 177 



Qy 


176 


LFADGKLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAV DY 

: : 1 M 1 : 1 1 : 1 1 1 1 : : 1 : 1 1 1 1 1 1 : II 1 1 1 1 : 
VLVGEELYSGVTADLMGRDFTIFRSLGQNPSLRTEPHDSRWLNEPKFVKVFWIPESENPD 


228 


Db 


178 


237 


Qy 


229 


GDYIYFFFREIAVE-YNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVP- 

1 1 1 1 1 1 M 1 1 1 II:: 1 1 1 : 1 : 1 1 : 1 1 1 1 1 : 1 1 : 1 1 1 1 1 1 1 1 1 1 
DDKIYFFFRESAVEAAPAMGRMSVSRVGQICRNDLGG-QRSLVNKWTTFLKARLVCSVPG 


286 


Db 


238 


296 


Qy 


287 


— GDSHFYFNILQAVTDVIRINGRD WLATFSTPYNSI PGSAVCAYDMLDIASVFT 

1 1 : 1 1 2 1 1 : : 1 1 : : 1 1 1 1 1 1 1 1 1 1 1 1 : 1 
VEGDTHF DQLQDVFLLSSRDRQTPLLYAVFSTSSGVFQGSAVCVYSMNDVRRAFL 


340 


Db 


297 


351 


Qy 


341 


GRFKEQKS PDSTWTPVPDERVPKPRPGCCAGS S SLERYATSNEFPDDTLNFI KTHPLMDE 
1 : : 1 1 1 1 1 1 1 II 1 1 : : : : : : 1 1 1 1 : 1 : 1 II 1 
GPLPHKEGPTHQWVSY-QGRVPYPRPGMCP-SKTFGTFSSTKDFPDDVIQFGRNHPLMYN 


400 


Db 


352 


409 


Qy 


401 


AVPSIFNRPWFLRTMVRYRLTKIAVDTAAGPYQNHTWFLGSEKGIILKFLA RIGN 

1 : II II: 1 hill 1 : : | : | : |: : | : | | : : | | 
PVLPMGGRPLFLQVGAGYTFTQIAADRVAAADGHYDVLFIGTDVGTVLKVISVPKGRRPN 


456 


Db 


410 


469 


Qy 


457 


SGFLNDSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCE 
1 : 1 II 1 : 1 : : III: 1 1 1 1 1 : : 1 1 1 
S EGLLLEELQVFE DSAAI T SMQI S S KRQQLYVAS RAAVAQI ALHRCT 


516 


Db 


470 


516 


Qy 


517 


RHGKCKKTCIASRDPYCGWIKEGGACSHLSPNSRLTF-EQDIERGNTDGL — GDCHNS 571 

h 1 :M Ml 1 :| ||: | :: | ||| |: | || :| 
ALGRACAECCLARDPYCAW — DGSACTRFQPTAKRRFRRQDIRNGDPSTLCSGDSSHS 572 


Db 


517 



RESULT 10 
158169 

semaphorin III - mouse (fragment) 

C; Species: Mus musculus (house mouse) 

C;Date: 26~Jul-1996 #sequence_revision 26-Jul-1996 #text_change 24-Sep-1999 
C;Accession: 158169 

R;Messersmith, E.K.; Leonardo, E.D.; Shatz, C.J.; Tessier-Lavigne, M. ; Goodman, 
C.S.; Kolodkin, A. L . ■ 
Neuron 14, 949-959, 1995 

A; Title: Semaphorin III can function as a selective chemorepellent to pattern 
sensory projections in the spinal cord. 

A; Reference number: 158169; MUID: 95267432 ; PMID:7748562 
A; Accession: 158169 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-666 <RES> 

A;Cross-references: GB:L40484; NID:g703189; PIDN : AAA73934 . 1 ; PID:g703190 

C; Genetics : 

A; Gene: Semalll 

C; Superf amily : semaphorin 

Query Match 14.6%; Score 793.5; DB 2; Length 666; 

Best Local Similarity 34.1%; Pred. No. 4.1e-45; 

Matches 188; Conservative 89; Mismatches 226; Indels 49; Gaps 15; 

Qy 116 ECHNFIKVLLKKNDDALFVCGTNAFNPSCR NYKMDTLEPFGDEF — SGMARCPYD 168 

I I I II I I I I I : I II I h I I : : I : I : I : I I I 

Db 7 ECANFIKVLEAYNQTHLYACGTGAFHPICTYIEVGHHPEDNIFKLQDSHFENGRGKSPYD 66 



Qy 


169 


AKHANVALFADGKLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQA 

1 ; l 11:111 1 :h 1 |:|:|| : 1 1 : 1 1 1 : 1 1 : 1 1 : 1 
PKLLTASLLIDGELYSGTAANFMGRDFAI FRTLGHHHPIRTEQHDSRWLNDPRFISAHLI 


225 


Db 


67 


126 


Qy 


226 


VDYGDYIYFFFREIAVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARL 

: 1 1 M M 1 : : | 1 : 1 : 1 1 1 1 1 1 I I : I 1 : 1 1 1 1 1 1 
PESDNPEDDKVYFFFRENAIDGEHSGKATHARIGQICKNDFGG-HRSLVNKWTTFLKARL 


281 


Db 


127 


185 


Qy 


282 


NCSVPG DSHFYFNI LQAVTDVI RINGRD WLAT FS T P YN S I P G S AVC AYDML 

INN MM : II :| :| :| hi 1 Mill 1 1 
ICSVPGPNGIDTHF DELQDVFLMNSKDPKNPIVYGVFTTSSNIFKGSAVCMYSMS 


333 


Db 


186 


240 


Qy 


334 


DIASVFTGRFKEQKS PDSTWTPVPDERVPKPRPGCCAGS S SLERYATSNEFPDDTLNFI K 
1 : 1 1 : : 1 : II 1 1 1 1 1 1 1 1 1 : : : : : 1 1 1 : 1 : 
DVRRVLLGPYAHRDGPNYQWVPY-QGRVPYPRPGTCP-SKTFGGFDSTKDLPDDVITFAR 


393 


Db 


241 


298 


Qy 


394 


THPLMDEAVPSIFNRPWFLRTMVRYRLTKIAVDTAAGPYQNHTWFLGSEKGIILKFLAR 


453 


Db 


299 


: 1 1 1 1 1 1 1 1 : : 1 1 1 : | : | | | : | : | : | : : | : | | : : 

SHPAMYNPVFPINNRPIMIKTDVNYQFTQIWDRVDAEDGQYDVMFIGTDVGTVLKWSV 


358 


Qy 


454 


IGNSGFLNDSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLG 


513 


Db 


359 


= : : 1 1 1 1 : 1 : 1 1 I : I I I : : I : : | | 
PKETWHDLEEVLLEEMTVFR EPTTISAMELSTKQQQLYIGSTAGVAQLPLH 


409 


Qy 


514 


RCERHGKCKKTCIASRDPYCGWIKEGGACSHLSPNS-RLTFEQDIERGNTDGLGDCHNSF 
1 1 : : 1 1 1 : M 1 1 1 1 : 1 : 1 1 1 : I I III | [ 1 | : 
RCDIYGKACAECCLARDPYCAW — DGSSCSRYFPTAKRRTRRQDIRNG — DPLTHCSDLQ 


572 


Db 


410 


465 




O / O 


VALNbtibb^L LPbi I rSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQ 

1 1 1 1 : 1 : : : | : : : | : : | : : : | : 
HHDNHHGPSLEERIIYGVENSSTFLECSPKSQRALVYWQFQRRNEDRKEEI-KMGDHIIR 


628 


Db 


466 


524 


Qy 


629 


DKKGVI RES YLK 640 




Db 


525 


: : 1 : : 1 1 

TEQGLLLRSLQK 536 





RESULT 11 
T27165 

hypothetical protein Y54E5B.1 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C; Accession: T27165 
R;Lennard, N. 

submitted to the EMBL Data Library, October 1998 
A; Reference number: Z20321 
A; Accession: T27165 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A; Residues: 1-712 <WIL> 

A; Cross-references: EMBL : AL032653 ; PIDN: CAA21714 . 1; GSPDB : GN00019; CESP : Y54E5B . 1 

A; Experimental source: clone Y54E5B 

C; Genetics : 

A; Gene: CESP : Y54E5B . 1 

A; Map position: 1 



Introns: 36/1; 70/2; 85/3; 194/2; 221/1; 253/3; 301/1; 352/3; 522/2; 560/1; 
9/3; 655/3 

Query Match 14.5%; Score 789; DB 2; Length 712; 

Best Local Similarity 28.7%; Pred. No. 9.1e-45; 

Matches 229; Conservative 129; Mismatches 282; Indels 158; Gaps 29 

6 LLLYFTLLHFAGAGFPEDSEPI SI SHGNYTKQYP VFVGHKPGRNTTQRHRLDIQ 59 

Mil:: II I: | | | :| : | | | 

7 LLLLFNW RSSEAIT GGWNLRPKQIINSVGIGDRFGGIGTSSDESDHF 55 

60 MIMIMNG-TLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECH 118 

:: : I : I : I I : : I : : I : : |: || :: I MM I :| 

56 KLLAADGD S LLVGARNAVYNL S L S T LSVNHKIDWKPPAEHIEECIMKGKSKTDCQ 110 

119 NFIKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFG DEFSGMARCPYDAKHANV 174 

I : I : I I : I : I I I I : I M I II I I : M : | | Ml II : 

111 N YI RVLARKSAGVSLVCGTHAFS PKCREY TVTEFGI RNTRQFDGQGI S P YDPKHNSS 167 

175 ALFADG KLYSATVTDFLAI DAVT YR-SLGESPT LRTVKHDSKWLKEPYFVQA 225 

IM I : I : II M II : I I : I I I : : | : | : : || : I : : I III 

168 AL YVP GTNQLFVATVT DFVGNDAL I YRKT I DET P S S KSAANI RTQ S YDARVLNAPN FVAT 227 

226 VDYGDYIYFFFREIAVEY— NTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNC 283 

I : : : M : M I II I I : : I I I : I M I I I I : I : M I I : I I I II M 

228 FAYKEHVYFWFREIASEAI DNNEEPQIYARVARVCKNDKGGA-RPANERWTSYLKARLNC 286 

284 SVP-GDSHFYFNILQAVTDVIRI-NGRDWLATFSTPYNSIPGSAVCAYDMLDIASVF-T 340 

I : I I I MM I : I I : I I I || MM : : MM : I | | 
287 SLPSGSSPFYFNELKAVSDPIDAGNNNHWYTVFSTPDSDVRMSAVCKFSMKKIREEFDN 34 6 

341 GRFKEQKSPDSTWTPVPDERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDE 400 

I M I : II I II I I I I I : I : : | : : | : : | I MM 

347 GT FKHQNNAQ SMWMAFNRNEVP KPRP GS C S P D S T KLPENTVSFILHHPLLHR 398 

4 01 AVPSIFNRPWFLRTMVRYRLTKIAV DTAAGPYQNHTWFLGSEKGIILKFLARIGNS 457 

Ml: I : I MM I I I : M ::|:|: I Ml : I I : 

399 PIP S V- AAP L LVE GAD RADLT Q I T VL P RVRAVGGH - NYDILFIGTSDG KVL KWEVD GNA 456 

458 GFLND-SLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCE 516 

: :M : : I | |: ; : : :|: | 

457 T VI Q S AT VFQ RGVP I VN LLTTKESWIVSADEIASLPVHNCA 498 

517 RHGKCKKTCIASRDPYCGWIKEGGACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALN 576 

: I I I: MIM I II : | |:: | :: | : 

499 QQTSCSK-CVQLQDPHCAWDSSIARCVHGGSWTGDQFIQNMVFGQSE QCPEGIIV— 552 

577 GHSSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RE 636 

■ : : I : I III I 
553 REVFDDNESEAQPEAVS RS 571 

637 SYLKGHDQLVPVTLLAIAVILAFVMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHS-- 694 

I I I : I : I M : : : : M I I II I : I : | : | 

572 GYPKEHSTI T WLVAAVAS L I S L 1 1 GA- F I G I RV NRWAATSEPHRSASSTSGSDY 625 

695 RRGSMSSVTKLSGLFGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKA 745 



Db 



626 DSFGRARLTRHDSLTTATKVDHGF VPQ S KQ S VDAT S LVMS I NA 668 



Qy 74 6 DQHHLDLTALPTPESTPT 763 

I : :: : :||: 
Db 669 THHPMSMSQHGSGINTPS 686 

RESULT 12 
C49423 

semaphorin II precursor - fruit fly (Drosophila melanogaster ) 
C; Species: Drosophila melanogaster 

C;Date: 06-Jan-1995 #sequence_revision 06-Jan-1995 #text change 07-May-1999 
C; Accession: C49423 

R;Kolodkin, A.L.; Matthes, D.J.; Goodman, C.S. 
Cell 75, 1389-1399, 1993 

A; Title: The Semaphorin genes encode a family of transmembrane and secreted 
growth cone guidance molecules. 

A; Reference number: A49423; MUID : 94 094332 ; PMID: 8269517 
A; Accession: C49423 

A; Status: preliminary; nucleic acid sequence not shown; not compared with 

conceptual translation 

A; Molecule type: mRNA 

A; Residues: 1-724 <KOL> 

A;Cross-references : GB:L26083 

C; Genetics : 

A; Gene: sema II 

A; Cross-references : FlyBase : FBgn0011260 
C; Super family : semaphorin 

Query Match 14.3%; Score 779.5; DB 2; Length 724; 

Best Local Similarity 33.5%; Pred. No. 4.1e-44; 

Matches 193; Conservative 99; Mismatches 199; Ihdels 85; Gaps 22; 

QY 33 NYTKQYPVFVGHKPGRNTTQRHRLD IQMIMIMN GTLYIAARDHIYTV 7 9 

I 5 : I 1= I I -'II II III: I I :: I 

Db 32 NFYYERPCCTGNDQGNNNYGKHGADHVREFNCGKLYYRTFHMNEDRDTLYVGAMDRVFRV 91 

^ 80 DIDTSHTEEIYCSK-KLTWKSRQADVDTCRMKGKHK-DECHNFIKVLLKKND-DALFVCG 136 

= = : l = : : : : II :| III : :| I ::|: : I I : I I I 

Db 92 NLQNISSSN— CNRDAINLEPTRDDWSCVSKGKSQIFDCKNHVRVIQSMDQGDRLYVCG 14 9 

Qy 137 TNAFNPSCRNY KMDTLEPFGDEFSGMARCPYDAKHANVALFADG 180 

I I I I I = = I : : : I I : I : M I I : | : : : 

Db 1^0 TNAHNP KDYVIYANLTHLPRSEYVI GVG LGIAKCPYDPLDNSTAI YVENGNPGGL 204 

Qy 181 -KLYSATVTDFLAIDAVIYRS LGESPTLRTVKHDSKWLKEPYFVQAVDYGDY 231 

I I I I : I I I I : I : I I I : I : I I I I I : I I I : I I : I 

Db 205 PGLYSGTNAEFTKADTVIFRTDLYNTS7VKRLEYKFKRTLKYDSKWLDKPNFVGSFDIGEY 264 

Qy 232 IYFFFREIAVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHF 291 

: I I I I I I I I I I I I I : I : I : I I I I : I I : : | I : : I I I I I I I I : I : I 
Db 265 WFFFRETAVEYINCGKAVYSRI7VRVCKKDVGG-KNLLAHNWATYLKARLNCSISGEFPF 323 

QY 292 YFNILQAVTDVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDS 351 

III =1=1 = I 111:1 I : Mill:: : :| : I hllll I :| 

Db 324 YFNEIQSVYQLPSDKSR--FFATFTTSTNGLIGSAVCSFHINEIQAAFNGKFKEQSSSNS 381 



Qy 


352 


Db 


382 


Qy 


411 


Db 


434 


Qy 


469 


Db 


491 


Qy 


528 


Db 


537 



11 = 111:11111 : I II I I I I :: I I I I I : I I I I : 

■PVLNSRVPEPRPGTCVNDTS NLPDTVLNFIRSHPLMDKAVNHEHNNPVY 433 



I =1 M II : I I : I ::|: | | | : | |: | : | 

[RDLVFTKL WDKI RI DI L NQEYIVYYVGTNLGRIYKIVQYYRNGESLSKLLDIFE 490 



I I:: : III: : :: | | |: | : |: 

-PNEAIQVMEISQTRKSLYIGTDHRIKQIDLAMCNRRYDNCFR-CV- 536 



I I I I I I I I I I | || 



RESULT 13 
G02173 

semaphorin III family hornolog - human 
C; Species: Homo sapiens (man) 

C;Date: 21-Dec-1996 #sequence_revision 06-Jun-1997 #text_change 24-Sep-1999 
C /Accession: G02173 
R;Naylor, S. 

submitted to the EMBL Data Library, October 1995 
A; Reference number: G09275 
A; Accession: G02173 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-753 <NAY> 

A;Cross-references: EMBL:U38276; NID; gl061350; PIDN : AAB18276 . 1 ; PID:gl061351 
C; Superfamily : semaphorin 

Query Match 14.1%; Score 768; DB 2; Length 753; 

Best Local Similarity 31.6%; Pred. No. 2.5e-43; 

Matches 191; Conservative 109; Mismatches 207; Indels 98; Gaps 22; 

QY 6 LLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMI 61 

III: :ll I II I I I I II : : 

Db 6 LLLWASLLTGAWPSFPTQD HLP ATPRVRLSFKELKATG 43 

QY 6 2 MIMNGT LYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQA 102 

: : I I : I : : : | : : : : | : : | : : | : 

Db 44 TAHFFNFLLNTTDYRILLKDEDHDRMYVGSKDYVLSLDLHDINREPLI IHWAASPQ 99 

Qy 103 DVDTCRMKGKH-KDECHNFIKVLLKKNDDALFVCGTNAFNPSC RNYKMDTL 152 

: : I : I I MM:::: I I : I II I I : I I I : : I : I 

Db 100 RIEECVLSGKDWGECGNFVRLIQPWNRTHLYVCGTGAYNPMCTYVNRGRRAQDY-IFYL 158 

Qy 153 EPFGDEFSGMARCPYDAKHANVALFADGKLYSATVTDFLAIDAVIYRSLGESPTLRTVKH 212 

II Ml Mill I : : :||: ||: || |:|:||: :|| :: 

Db 159 EPERLE-SGKGKCPYDPKLDTASALINEELYAGVYIDFMGTDAAI FRTLGKQTAMRTDQY 217 

Qy 213 DSKWLKEPYFVQA VT)YGDYIYFFFREIAVEYNTMGKWFPRVAQVCKNDMGGSQ 2 66 

M M I : I I : I : I : I I I I I I : I I : I : :: I I I I I 

Db 218 NSRWLNDPS FIHAELI PDSAENDDKLYFFFRERSAE-APQS PAVYARI GRI CLNDDGGHC 276 



Qy 267 RVLEKQWTSFLKARLNCSVPGDS — HFYFNILQAV TDVI RINGRDWLAT FST P YN 320 

: : I I : : I I I I M I I I I I : : I : I I I I : I I : I I : : : 

Db 277 CLVNK-WSTFLKARLVCSVPGEDGIETHFDELQDVFVQQTQDVR NPVIYAVFTSSGS 332 

Qy 321 SIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDERVPKPRPGCCAGSSSLERYAT 380 

I I I I I I I I I I I I I : : I : II : : I I I I I I I : : 

Db 333 VFRGSAVCVYSMADIRMVFNGPFAHKEGPNYQWMPF-SGKMPYPRPGTCPGGTFTPSMKS 391 

Qy 381 SNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRLTKIA VDTAAGPYQNHTV 437 

: :: I I : : I I : : : I I II : II : II : I I I I I I I I M : I I I : I 
Db 392 TKDYPDEVINFMRSHPLMYQAVYPLQRRPLWRTGAPYRLTTIAVDQVDSADGRYE V 448 

Qy 438 VFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRASS 4 97 

: I I I : : : I : I : : : I I I I : I : : : I : 

Db 44 9 LFLGTDRGTVQKVIV-LPKDDQEMEELMLEEVEVFK D PAP VKTMT I S S KRQ 498 

Qy 498 SLYVAFSTCVT KVPLGRCERHGKCKKTCIASRDPYCGWIKEGGACSHLSPNS-RLTFEQD 556 

I I I I : I : I I I : : I I : I I I I I I : I I I I : : I I : II 

Db 499 QLYVASAVGVTHLSLHRCQAYGAACADCCLARDPYCAW — DGQACSRYTASSKRRSRRQD 556 

Qy 557 IERGN 561 

: I I 

Db 557 VRHGN 561 



RESULT 14 
148748 

semaphorin E - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 02-Jul-1996 #sequence_revision 02-Jul-1996 #text_change 24-Sep-1999 
C; Accession: 148748 

R;Puschel, A.W. ; Adams, R.H.; Betz, H. 
Neuron 14, 941-948, 1995 

A; Title: Murine semaphorin D/collapsin is a member of a diverse gene family and 
creates domains inhibitory for axonal extension. 
A/Reference number: 148744; MUID: 95267431; PMID:7748561 
A;Accession: 148748 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-7 51 <RES> 

A;Cross-references: EMBL: X85994 ; NID:g854331; PIDN : CAA59986 . 1 ; PID:g854332 
C; Genetics: 
A; Gene: semE 

C; Superf amily: semaphorin 

Query Match 13.5%; Score 737; DB 2; Length 751; 

Best Local Similarity 31.3%; Pred. No. 3e-41; 

Matches 187; Conservative 103; Mismatches 234; Indels 74; Gaps 18; 
Qy 24 SEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMIMI — MNGTLYIAARDHIYTVDI 81 



Db 42 SEYFSLSH QQLDYRILLMDEDQDRIYVGSKDHILSLNI 79 

Qy 82 DTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDE-CHNFIKVLLKKNDDALFVCGTNAF 140 

: I : : I : I : I : I I I I I I : : I : I I : I I I : I I 

Db 8 0 NNISQEPL SVFWPAST I KVEECKMAGKDPTHGCGNFVRVI QTFNRTHLYVCGS GAF 135 



Qy 


141 


NPSC RNYKMDTLEPFGDEF SGMARCPYDAKHANVALFADGKLYSATVT 

: M 1 : : |: MM:: |:: : :|:| 

S PVCT YLNRGRRS E DQVFMIDSKCESGKGRCSFNPNVNTVSVMINEELFSGMYI 


188 


Db 


136 


189 


Qy 


189 


DFLAI DAVIYRSLGESPTLRTVKHDSKWLKEPYFVQA VDYGD YIYFFFREIAV 
H= M 1:111 : III :|:|||| II II 1 I I :||||:| 
DFMGTDAAIFRSLTKRMQLRTDQHNSKWLSEPMFVDAHVIPDGTDPNDAKVYFFFKERLT 


241 


Db 


190 


249 


Qy 


242 


EYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDS--HFYFNILQAV 
: 1 1 : M:M II II II | :||:|||[|| III : :|: |: I 
DNNRSTKQIHSMIARICPNDTGG-QRSLVNKWTTFLKARLVCSVTDEDGPETHFDELEDV 


299 


Db 


250 


308 


Qy 


300 


TDVIRINGR-DWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPD 


358 


Db 


309 


: II : 1 Ml : MIM 1 : II Ml 1 1 :: M 
FLLETDNPRTTLVYGIFTTSSSVFKGSAVCVYHLSDIQTVFNGPFAHKEGPNHQLISY-Q 


367 


Qy 


359 


ERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRY 
1 M 1 1 1 1 1 1 : M M M 1 : 1 1 : M II : : 1 I I : | | 
GRIPYPRPGTCPGGAFTPNMRTTKDFPDDWTFIRNHPLMYNSISPIHRRPLIVRIGTDY 


418 


Db 


368 


427 


Qy 


419 


RLTKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCS 
: IMIM : |:|||:::| : | : || : I |||: |: : 
KYTKIAVDRWAADGRYHVLFLGTDRGTVQKVVVLPTNSS-ASGELILEELEVFKNH 


478 


Db 


428 


483 


Qy 


479 


YDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKE 
1 M ; MM: IMIM M 1 : M II 1 1 : 
VP I TTMEI S S KKQQLYVS SNEGVSQVS LHRCHI YGTACADCCLARDP YCAW- - D 


538 


Db 


484 


535 


Qy 


539 


GGACSHLSP-NSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQE 595 
1 Ml 1 I : ||: || || : ::: :: ::|| | 


Db 


536 


GHSCSRFYPTGKRRSRRQDVRHGNP— LTQCRGFNLKAYRNAAEIVQYGVRNNSTFLE 591 



RESULT 15 
S66498 

M-sema F protein precursor - mouse 
C; Species: Mus musculus (house mouse) 

C;Date: 28-Oct-1996 #sequence_revision 13-Mar-1997 #text_change 24-Nov-1999 
C; Accession: S 664 9 8 

R;Inagaki, S.; Furuyama, T . ; Iwahashi, Y. 
FEBS Lett. 370, 269-272, 1995 

A; Title: Identification of a member of mouse semaphorin family. 

A; Reference number: S66498; MUID: 95385809 ; PMID: 7656991 

A; Accession: S 664 98 

A; Status: preliminary 

A; Molecule type: mRNA 

A; Residues: 1-834 <INA> 

A; Cross-references: EMBL:S79463; NID : glll0598 ; PIDN : AAB35184 . 1 ; PID:glll0599 
C; Superfamily : semaphorin 

F; 1-2 1/Domain : signal sequence #status predicted <SIG> 
F;22~834/Product : M-sema F protein #status predicted <MAT> 

Query Match 12.7%; Score 692; DB 2; Length 834; 

Best Local Similarity 26.0%; Pred. No. 3.7e-38; 

Matches 232; Conservative 131; Mismatches 326; Indels 204; Gaps 34; 



^ 65 NGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGK-HKDECHNFIKV 123 



Db 


62 


:| ||: ||: :: : I : ::| :: 1 III :: II III: 

S GLLYVGAREALFAFSV EALELQGAI SWEAPAEKKI ECTQKGKSNQTEC FNFI RF 1 1 6 


Qy 


124 


LLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEF-SGMARCPYDAKHANVALFADGKL 
1 1 l:MM II 1 1 || M| :|||, : , ,|:) 
LQPYNSSHLYVCGTYAFQPKCTYINMLTFTLDRAEFEDGKGKCPYDPAKGHTGLLVDGEL 


182 


Db 


117 


176 


Qy 


183 


YSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQA VDYGDYI 
1111= -II : II 1 :l :::| :: : II ||:|| : | | 
YSATLNNFLGTEPVILRYMGTHHSIKT-EYLAFWLNEPHFVGSAFVPESVGSFTGDDDKI 


232 


Db 


177 


235 


Qy 


233 


YFFFREIAVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFY 

1 : 1 |||:| |: | |:|:||:||IMI II 1 1 1 

YFFFSERAVEYDCYSEQWARVARVCKGDMGGA-RTLQKKWTTFLKARLVCSAP-DWKVY 


292 


Db 


236 


293 


Qy 


293 


FNILQAVTDVIRINGRDW-LATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDS 
1 1 1 : 1 1 : : = 1 : : Mill: I I I I : 1 I 
FNQLKAVHTLRGASWHNTT FFGVFQARWGDMDLSAVCE YQLEQI QQVFEGP YKEYS EQAQ 


351 


Db 


294 


353 


Qy 


352 


TWTPVPDERVPKPRPGCCAGSSSLER-YATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPW 

1 1 1 1 1 1 1 1 1 : : 1 : 1 1 1 1 : 1 1 1 1 1 1 II 1 1 :: 1 I I 
KWARYTDP-VPSPRPGSCINNWHRDNGYTSSLELPDNTLNFIKKHPLMEDQVKPRLGRPL 


410 


Db 


354 


412 


Qy 


411 


FLRTMVRYRLTKIAVDTAAG-PYQNHTWFLGSEKGIILKFLARIGNSGFLNDSL 
: : : 1 : | | : 1 1 : 1 : 1 : 1 : | | : | | 
LVKKNTNF— THWADRVPGLDGATYTVLFIGTGDGWLLKAV SLGPWIH 


464 


Db 


413 


459 


Qy 


465 


FLEEMSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKT 


524 


Db 


460 


=11= l:= 1 : : | :: |: : :::: | | :: | 
MVEELQVFDQEP VESLVLSQSKKVLFAGSRSQLVQLSLADCTKYRFC-VD 


508 


Qy 


525 


CIASRDPYCGWIKEGGAC SHLS — PNSRLTFEQDIER 

1= UIMI 1 1 |:: |:: : | s: 
CVLARDPYCAWNVNTSRCVATTSGRSGSFLVQHVANLDTSKMCNQYGIKKVRS.IPKNITV 


559 


Db 


509 


568 


Qy 


560 


-GNTDGLGDCHNSFVALNGH-- -SSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDS 

M ■ 1 1 1 : 1 III: | : : | 
VSGTDLVLPCHLSSNLAHAHWTFGSQDLPAEQPGSFLYDTGLQALWM 


615 


Db 


569 


616 


Qy 


616 


TDPLGAVSSHN HQDKKG -VI RESYLKG HDQLVP VTLLAI AVI LAFV 

III: : : : : | : | | | | : | : | : : 
AAQSRHSGPYRCYSEEQGTRLAAESYLVAWAGSSVTLEARAPLENLGLVWLAWA 


660 


Db 


617 


672 


Qy 


661 


MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 
= MI : : 1 ||: : | | : | : : | 


720 


Db 


673 


LGAVCL-VLLLLVLSLRRR LREELE KGAKAS ER 


704 


Qy 


721 


AILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREW 


777 


Db 


705 


: = N II ::| : I : | 


735 


Qy 


778 


ERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQ 
= = == = 1 1 1 1 : 1 : I I | : : | : : || 
GYYYSDGSLKIVPGHARCQPGGGPPSPPPGIPGQPLPS — PTRLHLGGGRNSNANGYVR- 


832 


Db 


736 


792 


Qy 


833 


PKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQRE 885 





I II II I : II: 



Db 7 93 LQLGGEDRGGS 



GHPLPELADELRRKLQQRQ 822 



Search completed: March 24, 2004, 13:17:10 
Job time : 33.8439 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: March 24, 2004, 13:14:29 ; Search time 74.7731 Seconds 

(without alignments) 
3567.110 Million cell updates/sec 

Title: US-09-856-681A-2 
Perfect score: 5450 

Sequence: 1 MRS EAL L L Y FT L LH FAGAG F PPKPSFAPLSTSMKPNDACT 1030 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1049977 seqs, 258955339 residues 

Total number of hits satisfying chosen parameters: 1049977 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Published_Applications_AA: * 

1: / cgn2_6/ptodata/2/pubpaa/US07_PUBCOMB.pep: * 

2 : / cgn2_6/ptodata/2/pubpaa/PCT_NEW_PUB . pep : * 

3 : /cgn2_6/ptodata/2/pubpaa/US06_NEW_PUB . pep : * 

4 : /cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB.pep: * 

5 : /cgn2_6/ptodata/2/pubpaa/US07_NEW_PUB . pep : * 

6: /cgn2_6/ptodata/2/pubpaa/PCTUS_PUBCOMB.pep: * 

7 : /cgn2_6/ptodata/2/pubpaa/US08_NEW_PUB . pep : * 

8 : / cgn2_6/ptodata/2/pubpaa/US08_PUBCOMB . pep : * 

9: /cgn2_6/ptodata/2/pubpaa/US09A_PUBCOMB.pep: * 
10 : /cgn2_6/ptodata/2/pubpaa/US09B_PUBCOMB . pep : * 
11 : /cgn2_6/ptodata/2/pubpaa/US09C_PUBCOMB.pep: * 
12 : /cgn2_6/ptodata/2/pubpaa/US09_NEW_PUB.pep : * 
13 : /cgn2_6/ptodata/2/pubpaa/US10A_PUBCOMB.pep: * 
14 : /cgn2_6/ptodata/2/pubpaa/US10B_PUBCOMB.pep: * 
15: /cgn2_6/ptodata/2/pubpaa/US10C_PUBCOMB.pep: * 
16 : /cgn2_6/ptodata/2/pubpaa/US10_NEW_PUB . pep : * 
17 : /cgn2_6/ptodata/2/pubpaa/US60_NEW_PUB.pep: * 
18 : /cgn2_6/ptodata/2/pubpaa/US60_PUBCOMB.pep : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result Query 

No. Score Match Length DB ID Description 
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100 
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12 
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3 


5431. 5 


99 


. 7 


1047 


15 
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-48 


Sequence 


48, 
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4 
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99 


.5 


1047 


10 
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-09 
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-187 
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Sequence 
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5 


5422.5 


99 


.5 


1047 


12 
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-10 


-403- 


-676 
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Sequence 


14, 


Appl 


6 


5422. 5 


99 


5 


1047 


15 


us- 


-10 


-449- 


-548 


-14 


Sequence 


14, 


Appl 


7 


5368 


98 


5 


1018 


12 


us- 


-10- 


-403- 


-676 


-28 


Sequence 


28, 


Appl 


8 


5368 


98 


5 


1018 


15 


us- 


-10- 


-449- 


-548 


-28 


Sequence 


28, 


Appl 


9 


5349.5 


98 


2 


1035 


12 


us- 


-10- 


-403- 


-676 


-18 


Sequence 


18, 


Appl 


10 


5349.5 


98 


2 


1035 


15 


us- 


-10- 


-449- 


-548- 


-18 


Sequence 


18, 


Appl 


11 


5134.5 


94. 


2 


998 


12 


us- 


-10- 


-403- 


-676- 


-20 


Sequence 


20, 


Appl 


12 


5134.5 


94. 


2 


998 


15 


us- 


-10- 


-449- 


-548- 


-20 


Sequence 


20, 


Appl 


13 


5113.5 


93. 


8 


981 


12 


us- 


-10- 


-403- 


-676- 


-30 


Sequence 


30, 


Appl 


14 


5113.5 


93. 


8 


981 


15 


us- 


-10- 


-449- 


-548- 


-30 


Sequence 


30, 


Appl 


15 


5109.5 


93. 


8 


971 


12 


us- 


-10- 


-403- 


-676- 


-46 


Sequence 


46, 


Appl 


16 


5109. 5 


93. 


8 


971 


15 


us- 


-10- 


-449- 


-548- 


-46 


Sequence 


46, 


Appl 


17 


4921 


90. 


3 


939 


10 


us- 


-09- 


-957- 


-187- 


-4 


Sequence 


4, 


Appli 


18 


4921 


90. 


3 


939 


12 


us- 


-10- 


-403- 


-676- 


-32 


Sequence 


32, 


Appl 


19 


4921 


90. 


3 


939 


15 


us- 


-10- 


-449- 


-548- 


-32 


Sequence 


32, 


Appl 


20 


4590.5 


84. 


2 


884 


10 


us- 


-09- 


-957- 


-187- 


-6 


Sequence 


6, < 


Appli 


21 


4346.5 


79. 


8 


888 


12 


us- 


-10- 


-016- 


-248- 


-64 


Sequence 


64, 


Appl 


22 


3373 


61. 


9 


630 


10 


us- 


-09- 


-957- 


-187- 


-30 


Sequence 


30, 


Appl 


23 


3368 


61. 


8 


861 


12 


us- 


-10- 


-403- 


-676- 


-52 


Sequence 


52, 


Appl 


24 


3368 


61. 


8 


861 


15 


us- 


-10- 


-449- 


-548- 


-52 


Sequence 


52, 


Appl 


25 


3367.5 


61. 


8 


636 


12 


us- 


10- 


-403- 


-676- 


-36 


Sequence 


36, 


Appl 


26 


3367.5 


61. 


8 


636 


15 


us- 


-10- 


-449- 


548- 


-36 


Sequence 


36, 


Appl 


27 


3363 


61. 


7 


666 


12 


us- 


-10- 


-403- 


676- 


-56 


Sequence 


56, 


Appl 


28 


3363 


61. 


7 


666 


15 


us- 


-10- 


-449- 


548- 


-56 


Sequence 


56, 


Appl 


29 


3362 


61. 


7 


62 6 


10 


us- 


-09- 


-957- 


187- 


-83 


Sequence 


83, 


Appl 


30 


3362 


61. 


7 


62 6 


12 


us- 


10- 


-403- 


676- 


-34 


Sequence 


34, 


Appl 


31 


3362 


61. 


7 


62 6 


15 


us- 


10- 


-449- 


548- 


-34 


Sequence 


34, 


Appl 


32 


3351.5 


61. 


5 


649 


12 


us- 


10- 


-403- 


676- 


-50 


Sequence 


50, 


Appl 


33 


3351. 5 


61. 


5 


64 9 


15 


us- 


10- 


-449- 


548- 


-50 


Sequence 


50, 


Appl 


34 


3349.5 


61. 


5 


878 


12 


us- 


10- 


-403- 


676- 


-54 


Sequence 


54, 


Appl 


35 


3349.5 


61. 


5 


878 


15 


us- 


10- 


-449- 


548- 


-54 


Sequence 


54, 


Appl 


36 


3285.5 


60. 


3 


640 


12 


us- 


10- 


•403- 


676- 


-16 


Sequence 


16, 


Appl 


37 


3285.5 


60. 


3 


640 


15 


us- 


10- 


-449- 


548- 


-16 


Sequence 


16, 


Appl 


38 


2627 


48. 


2 


497 


12 


us- 


10- 


-403- 


676- 


-24 


Sequence 


24, 


Appl 


39 


2627 


48. 


2 


497 


15 


us- 


10- 


-449- 


548- 


-24 


Sequence 


24, 


Appl 


40 


2545 


46. 


7 


479 


12 


us- 


10- 


403- 


676- 


-26 


Sequence 


26, 


Appl 


41 


2545 


46. 


7 


479 


15 


us- 


10- 


449- 


548- 


-26 


Sequence 


26, 


Appl 


42 


2349.5 


43. 


1 


1088 


12 


us- 


10- 


016- 


248- 


-20 


Sequence 


20, 


Appl 


43 


2335 


42. 


8 


1035 


12 


us- 


10- 


016- 


248- 


12 


Sequence 


12, 


Appl 


44 


2330 


42. 


8 


442 


12 


us- 


10- 


403- 


676- 


22 


Sequence 


22, 


Appl 


45 


2330 


42. 


8 


442 


15 


us- 


10- 


449- 


548- 


22 


Sequence 


22, 


Appl 



ALIGNMENTS 



RESULT 1 

US-10-016-248-63 

; Sequence 63, Application US/10016248 

; Publication No. US20040033491A1 

; GENERAL INFORMATION: 

; APPLICANT: Alsobrook et al. 



; TITLE OF INVENTION: Proteins and Nucleic Acids Encoding Same 
; FILE REFERENCE: 21402-218 

; CURRENT APPLICATION NUMBER: US/10/016,248 
; CURRENT FILING DATE: 2002-09-20 

PRIOR APPLICATION NUMBER: 60/254,32 9 
; PRIOR FILING DATE: 2000-12-08 
; PRIOR APPLICATION NUMBER: 60/291,037 
; PRIOR FILING DATE: 2001-05-15 
; PRIOR APPLICATION NUMBER: 60/255,64 8 

PRIOR FILING DATE: 2000-12-14 
; PRIOR APPLICATION NUMBER: 60/297,173 
; PRIOR FILING DATE: 2001-06-08 
; PRIOR APPLICATION NUMBER: 60/309,258 

PRIOR FILING DATE: 2001-07-31 
; PRIOR APPLICATION NUMBER: 60/326,393 
; PRIOR FILING DATE: 2001-10-01 
; PRIOR APPLICATION NUMBER: 60/315,639 

PRIOR FILING DATE: 2001-08-29 
; NUMBER OF SEQ ID NOS : 167 
; SOFTWARE: PatentlnVer. 2.1 
; SEQ ID NO 63 

LENGTH: 1030 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-016-248-63 

Query Match 100.0%; Score 5450; DB 12; Length 1030; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1030; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | 

Db 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

Qy 61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

I > I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M | I I I I I I I I I I I I | | 
Db 61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 12 0 

Qy 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 18 0 

I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 180 

Qy 181 KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 24 0 

> I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

°b 181 KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 24 0 

Qy 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

I I M I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I | I I I I I I I I I I I I I I I I I I I I I 
Db 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVXEKQWTSFLK7VRLNCSVPGDSHFYFNILQAVT 300 

Qy 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | M I I I 
Db 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

Qy 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

I M I I I I I I I I II I I I I I I I I I I I I I | | | | | | | | | | | | || | | | | | || || | | | | | | | | | | | 
Db 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 



Qy 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 480 

I I I I 1 I I I I II I M I M M I I I 1 I I I I I I I M 1 I I ! M I I I I I I I I I I I I I I I I I I I I M 
Db 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 480 

Qy 481 GVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 540 

I I I I I I I I M I II I I I I I I M I II I I I I I I I I I I I I II I I I I I I I I I I I I II I I II I M I 
Db 481 GVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 54 0 

Qy 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESR 600 

I I I I M I I I I I M II i I II I I I I I I I I I I II I I I I I M I I I M I I I I M II I I I I I I II I 
Db 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESR 600 

Qy 601 GGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHDQLVPVTLLAIAVILAFV 660 

I I M I I I II II I I I M I M I M M I II M I II I I I II II II I I II I I II I I II I I II M I 

Db 601 GGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHDQLVPVTLLAIAVILAFV 660 

Qy 661 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 720 

I M M M M M I I M I M I M II M I M II I I M I M I M M I I I I I I I I II II I! I M I 

Db 661 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 720 

Qy 721 AILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERN 780 

I I I I I I ! I I I !! 1 I ! I ! I II I II I I I I I I I I I I I I I ! I I I I I I I I !!! I I I I I M M I I ! 
Db 721 AILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERN 780 

Qy 781 QNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVAQ 840 

I I II II I I I I I II II II II I I I I I I I I I I I M I I II I I I II I II M I M I I I! I I II I I I 

Db 781 QNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVAQ 840 

Qy 841 MALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLSQTGL 900 

II I I I M I II I I I I M I I II I I I II II M II I I M I I I I I I II I I II II I M I I I I I I I I 

Db 841 MALEDQAATLEYKTIKEHLSSKSPNHGWLVENLDSLPPKVPQREASLGPPGASLSQTGL 900 

Qy 901 SKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGDNPP 960 

II I I I I I I I I I II I I I I I I I I I I I I II I I I M I I I I I I I I I I I II I I I II I I I I I II I I I 
Db 901 SKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGDNPP 960 

Qy 961 PAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKPDVPPKPSFAPLS 1020 

II I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 961 PAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKPDVPPKPSFAPLS 1020 

Qy 1021 TSMKPNDACT 1030 

II I I I II I II 
Db 1021 TSMKPNDACT 1030 



RESULT 2 
US-10-403-676 
Sequence 48 



■48 



Application US/10403676 
Publication No. US20040029150A1 
GENERAL INFORMATION: 
APPLICANT: Alsobrook II, John 
APPLICANT: Anderson, David W. 
APPLICANT: Boldog, Ferenc L . 
APPLICANT: Burgess, Catherine E. 
APPLICANT: Casman, Stacie J. 
APPLICANT: Edinger, Shlomit R. 



APPLICANT : 


Gerlach, 


Valerie L. 


APPLICANT : 


Grosse, 


William M. 


APPLICANT : 


Guo, Xiaojia 


APPLICANT : 


Gusev, Vladimir Y. 


APPLICANT: 


Ji, Weiz 


hen 


APPLICANT: 


LaRochelle, William J. 


APPLICANT : 


Lepley, 


Denise M. 


APPLICANT : 


Li, Li 




APPLICANT : 


Liu, Xia 


ohong 


APPLICANT : 


MacDougall, John R. 


APPLICANT • 

<J LL X XJ _J_ V^*JlX X * 


Malyanka 


r, Uriel M. 


APPLICANT : 


Millet, 


Isabelle 


APPLICANT : 


Padigaru 


, Muralidhara 


APPLICANT • 

■*1*J- X XJ _l_ OiUl X * 


Pattura j 


an, Meera 


APPLICANT • 

ill X XJ _L Oiiii) X * 


Peyman, 


John A. 


APPLICANT: 


Rastelli 


, Luca 


APPLICANT • 

ill J- XJ _1_ VyJiJ, i| X * 


Reiger , 


Daniel 


APPLICANT: 


Rothenbe 


rg, Mark E. 


APPLICANT: 


Shimkets 


, Richard A. 


APPLICANT: 


Stone, David J. 


APPLICANT: 


Taupier, 


Raymond J. 


APPLICANT: 


Vernet, 


Corine 


APPLICANT: 


Zerhusen 


, Bryan D. 



; TITLE OF INVENTION: THERAPEUTIC POLYPEPTIDES, NUCLEIC ACIDS ENCODING SAME, 

AND METHODS OF USE 

; FILE REFERENCE: 21402-573B 

; CURRENT APPLICATION NUMBER: US/10/403,676 

; CURRENT FILING DATE: 2003-03-31 

; PRIOR APPLICATION NUMBER: 60/123,667 

; PRIOR FILING DATE: 1999-03-09 

; PRIOR APPLICATION NUMBER: 09/520,781 

; PRIOR FILING DATE: 2000-03-08 

; PRIOR APPLICATION NUMBER: 09/957,187 

; PRIOR FILING DATE: 2001-09-19 

; PRIOR APPLICATION NUMBER: 60/371,002 

; PRIOR FILING DATE: 2002-04-09 

; PRIOR APPLICATION NUMBER: 60/127,352 

; PRIOR FILING DATE: 1999-04-01 

; PRIOR APPLICATION NUMBER: 09/538,092 

; PRIOR FILING DATE: 2000-03-29 

; PRIOR APPLICATION NUMBER: 09/604,286 

; PRIOR FILING DATE: 2000-06-22 

; PRIOR APPLICATION NUMBER: 60/140,584 

; PRIOR FILING DATE: 1999-06-23 

; PRIOR APPLICATION NUMBER: 60/370,381 

; PRIOR FILING DATE: 2002-04-05 

; PRIOR APPLICATION NUMBER: 60/384,297 

; PRIOR FILING DATE: 2002-05-30 

; Remaining Prior Application data removed - See File Wrapper or PALM. 

; NUMBER OF SEQ ID NOS : 17 9 

; SOFTWARE: CuraSeqList version 0.1 

; SEQ ID NO 4 8 

LENGTH: 1047 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-403-676-48 



Query Match 99.7%; Score 5431.5; DB 12; Length 1047; 

Best Local Similarity 98.4%; Pred. No. 0; 

Matches 1030; Conservative 0; Mismatches 0; Indels 17; Gaps 1; 

1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

N M M I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I | I I I I I I I I I I I 

Db 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

QY 61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 12 0 

I I I I I II I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | M | | | | | | | | | | | | | | | | | | 
Db 61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

QY 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 18 0 

I I I I I I I I I I I I I I II | | || | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

Db 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 180 

QY 181 KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

I I I I I I M I I I I I I I I I I M I I I I I II I I I I I II I I I I I I I I I I I I I I | | | | | | | | | || | 
Db 181 KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 



QY 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

I I I I M I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 



Db 241 



QY 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

MINIUM I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

QY 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

I I N I I I I I I M I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
Db 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

Qy 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 4 80 

I I I M I I I I I I I II I I I || I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | || | | | | | | | | 
Db 421 TKIAVDTAAGP YQNHTWFLGS EKGI ILKFLARI GNSGFLNDSLFLEEMSVYNSEKCS YD 4 80 

Qy 481 GVEDKRIMGMQLDRASSSLYVAFSTCVT KVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 540 

N I I I I I I I I I I I I I I I I I I I I I I I I I I II I I | | | | | | | | | | | | | | | | | | || | | | | | | | | 
Db 481 GVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 540 

Qy 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALN GHSSSLL 583 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I II I I I I 

Db 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNDISTPLPDNEMSYNTVYGHSSSLL 600 

Qy 584 PSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHD 643 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I | | | | | | | | | 
Db 601 PSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHD 660 

Qy 644 QLVPVTLLAIAVI IAFVMGAVFSGITVYCVCDHRRKDVAVVQRKEKELTHSRRGSMSSVT 703 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I || 

Db 661 QLVPVTLLAIAVI LAFVMGAVFS GI TVYCVCDHRRKDVAWQRKEKELTHS RRGSMS S VT 720 

Qy 704 KLSGLFGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPT 763 

I N M I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 721 KLSGLFGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPT 780 



Qy 



764 LQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVI PTDLPLRASPSHIPSVWLPITQQ 823 



Db 781 LQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQ 840 

QY 824 GYQHEYVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQ 883 

I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I II I I I | | | | M II I II 

Db 841 GYQHEYVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGWLVENLDSLPPKVPQ 900 

QY 884 REASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSN 943 

I I I I I I I M II I I I I I I I I I I || | | | | || | | | | | | | | | | M I I I I I I I I I I I I I I I I I I I 
Db 901 REASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSN 960 

QY 944 SSHLSRNQSFGRGDNPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRT 1003 

I I M I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
Db 961 SSHLSRNQSFGRGDNPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRT 102 0 

Qy 1004 PSLKPDVPPKPSFAPLSTSMKPNDACT 1030 

I I I I I I II I I I I I I I I I I I I I I I I | | | 
Db 1021 PSLKPDVPPKPSFAPLSTSMKPNDACT 1047 



RESULT 3 

US-10-449-548-48 

Sequence 48, Application US/10449548 
Publication No. US20040018 977A1 
GENERAL INFORMATION: 
APPLICANT: Alvarez, Enrique 
APPLICANT: Anderson, David W. 
APPLICANT: Dhanabal, Mohanraj 
APPLICANT: Khramtsov, Nikolai V. 
APPLICANT: LaRochelle, William J. 
APPLICANT : Li, Li 
APPLICANT: Lichens tein, Henri 
APPLICANT: Ooi, Chean Eng 
APPLICANT: Padigaru, Muralidhara 
APPLICANT: Shimkets, Richard A. 
APPLICANT: Zhong, Mei 

TITLE OF INVENTION: SEMAPHORIN-LIKE PROTEINS AND METHODS OF USING SAME 
FILE REFERENCE: 15966-54 0CIP2 
CURRENT APPLICATION NUMBER: US/10/449, 548 
CURRENT FILING DATE: 2003-05-30 
PRIOR APPLICATION NUMBER: 09/520,781 
PRIOR FILING DATE: 2000-03-03 
PRIOR APPLICATION NUMBER: 60/123,667 
PRIOR FILING DATE: 1999-03-09 
PRIOR APPLICATION NUMBER: 60/234,082 
PRIOR FILING DATE: 2000-09-20 
PRIOR APPLICATION NUMBER: 60/233,798 
PRIOR FILING DATE: 2000-09-19 
PRIOR APPLICATION NUMBER: 60/174,485 
PRIOR FILING DATE: 2000-01-04 
PRIOR APPLICATION NUMBER: 10/403,676 
PRIOR FILING DATE: 2003-03-31 
PRIOR APPLICATION NUMBER: 60/371,002 
PRIOR FILING DATE: 2002-04-09 
PRIOR APPLICATION NUMBER: 60/384,798 
PRIOR FILING DATE: 2002-05-30 
PRIOR APPLICATION NUMBER: 60/402,407 



PRIOR FILING DATE: 2002-08-09 
PRIOR APPLICATION NUMBER: 60/443,062 
PRIOR FILING DATE: 2003-01-28 
NUMBER OF SEQ ID NOS : 58 
SOFTWARE: CuraSeqList version 0.1 
SEQ ID NO 48 
LENGTH: 1047 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-449-548-48 

Query Match 99.7%; Score 5431.5; DB 15; Length 1047; 

Best Local Similarity 98.4%; Pred. No. 0; 

Matches 1030; Conservative 0; Mismatches 0; Indels 17; Gaps 1; 

Qy 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

Qy 61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQ7VDVDTCRMKGKHKDECHNF 120 

Qy 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 180 

Qy 181 KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

Qy 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

M I I I I I I I I I I I I II I M I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

Qy 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I 
Db 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

Qy 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

Qy 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFL7VRIGNSGFLNDSLFLEEMSVYNSEKCSYD 480 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 480 

Qy 481 GVEDKRIMGMQLDRASSSLYVAFSTCVIKVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

Db 4 81 GVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 54 0 

Qy 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALN GHSSSLL 583 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNDISTPLPDNEMSYNTVYGHSSSLL 600 



Qy 



584 PSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHD 643 
I I I I I I I II I I I I I II I I I I I I M I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 



Db 


601 


PSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRESYLKGHD 


660 


Qy 


644 


QLVPVTLLAIAVILAFVMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVT 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 M 1 I 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 II 1 1 
QLVPVTLLAIAVILAFVMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVT 


703 


Db 


661 


720 


Qy 


704 


KLSGLFGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPT 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 
KLSGLFGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPT 


763 


Db 


721 


780 


Qy 


764 


LQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQ 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
LQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQ 


823 


Db 


781 


840 


Qy 


824 


GYQHEYVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQ 
M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 
GYQHEYVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQ 


883 


Db 


841 


900 


Qy 

Db 


884 
901 


REASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSN 
1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 | | | | 1 1 1 1 1 1 1 1 1 1 1 1 
REASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSN 


943 
960 


Qy 


944 


SSHLSRNQSFGRGDNPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 i i i i i i i i i i i i i i i i t i i i i 
' 1 1 1 1 1 1 1 1 1 ] 1 1 1 I 1 1 1 I 1 [ 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 J M 1 1 1 M | | | M | 1 

SSHLSRNQSFGRGDNPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRT 


1003 


Db 


961 


1020 


Qy 


1004 


PSLKPDVPPKPSFAPLSTSMKPNDACT 1030 
1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
PSLKPDVPPKPSFAPLSTSMKPNDACT 1047 




Db 


1021 





RESULT 4 

US-09-957-187-85 

; Sequence 85, Application US/09957187 

; Publication No. US20030054514A1 

; GENERAL INFORMATION: 

; APPLICANT: Shimkets, Richard A. 

; APPLICANT: LaRochelle, William 

; TITLE OF INVENTION: NOVEL POLYNUCLEOTIDES AND PROTEINS ENCODED THEREBY 

; FILE REFERENCE: 15966-540 CIP 

; CURRENT APPLICATION NUMBER: US/09/957,187 

; CURRENT FILING DATE: 2000-09-19 

; PRIOR APPLICATION NUMBER: 60/123,667 

; PRIOR FILING DATE: 1999-03-09 

; PRIOR APPLICATION NUMBER: 09/520,781 

; PRIOR FILING DATE: 2000-03-03 

; PRIOR APPLICATION NUMBER: 60/234,082 

; PRIOR FILING DATE: 2000-09-20 

; PRIOR APPLICATION NUMBER: 60/233,798 

; PRIOR FILING DATE: 2000-09-19 

; PRIOR APPLICATION NUMBER: 60/174,485 

; PRIOR FILING DATE: 2000-01-04 

; NUMBER OF SEQ ID NOS : 85 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 85 

LENGTH: 1047 

TYPE: PRT 



; ORGANISM: Homo sapiens 
US-09-957-187-85 



Query Match 99.5%; Score 5422.5; DB 10; Length 1047; 

Best Local Similarity 98.3%; Pred. No. 0; 

Matches 1029; Conservative 0; Mismatches 1; Indels 17; Gaps 1; 



QY 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

I I I I I I I I I I I I I I M I I I I I I I I I I II I II I I I I I I I II I I | | | | I 

MRSEALLLYFTLLHFAGAGFPEDSEPISISHCNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 



Db 1 



QY 61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 12 0 

I I I I I I N I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | 

Db 61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

Qy 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 18 0 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYD7VKHANVALFADG 180 

Qy 181 KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I 

Db 181 KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

Qy 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

I I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | 
Db 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

Qy 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

I I M I I I II I I I I I I I I I I I M I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I | | | | | 
Db 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

Qy 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I | | | | | | | | M I I I I I I I I I I I II 

Db 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

Qy 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 480 

I I I I I I M I I I I I I I I I I II I I I I | | || | | | | | || | || | | | | | | | | | | | | | | | | || | | | | 
Db 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 480 

Qy 481 GVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I | | | | | | I I I I I I I I II I 
Db 481 GVEDKRIMGMQLDRASSSLYVAFSTCVIKVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 540 

Qy 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALN GHSSSLL 583 

I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | II I I I I I I I 

D b 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNDISTPLPDNEMSYNTVYGHSSSLL 600 

Qy 584 PSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHD 643 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 PSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHD 660 

Qy 64 4 QLVPVTLLAIAVILAFVMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVT 7 03 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I | I | | | | | | | | | | | 
Db 661 QLVPVTLLAIAVILAFVMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVT 720 

Qy 704 KLSGLFGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPT 7 63 
I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 



721 



KLSGLFGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPT 



780 



Qy 


764 


Db 


781 


Qy 


824 


Db 


841 


Qy 


884 


Db 


901 


Qy 


944 


Db 


961 


Qy 


1004 


Db 


1021 



LQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQ 823 

I I I I I I I I I I I I I > I I I II I I I I I I I I I I I I I I I I I I I I I 

LQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSWVLPITQQ 84 0 

GYQHEYVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQ 883 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GYQHEYVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQ 900 

REASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSN 943 
M I I M II I I II I I II I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
REASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSN 960 

SSHLSRNQSFGRGDNPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRT 1003 

I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I II I I I I II I II I I I I I I I I II I 

SSHLSRNQSFGRGDNPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRT 1020 

P S LK P D VP P K P S FAP L S T SMK PN DACT 1030 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 
PSLKPDVPPKPSFAPLSTSMKPNDACT 1047 



RESULT 5 

US-10-403-676-14 

Sequence 14, Application US/10403676 

Publication No. US20040029150A1 
GENERAL INFORMATION: 

APPLICANT: Alsobrook II, John 

APPLICANT: Anderson, David W. 

APPLICANT: Boldog, Ferenc L. 

APPLICANT: Burgess, Catherine E. 

APPLICANT: Casman, Stacie J. 

APPLICANT: Edinger, Shlomit R. 

APPLICANT: Gerlach, Valerie L. 

APPLICANT: Grosse, William M. 

APPLICANT: Guo, Xiaojia 

APPLICANT: Gusev, Vladimir Y. 

APPLICANT: Ji, Weizhen 

APPLICANT: LaRochelle, William J. 

APPLICANT: Lepley, Denise M. 

APPLICANT: Li, Li 

APPLICANT: Liu, Xiaohong 

APPLICANT: MacDougall, John R. 

APPLICANT: Malyankar, Uriel M. 

APPLICANT: Millet, Isabelle 

APPLICANT: Padigaru, Muralidhara 

APPLICANT: Patturajan, Meera 

APPLICANT: Peyman, John A. 

APPLICANT: Rastelli, Luca 

APPLICANT: Reiger, Daniel 

APPLICANT: Rothenberg, Mark E. 

APPLICANT: Shimkets, Richard A. 

APPLICANT: Stone, David J. 

APPLICANT: Taupier, Raymond J. 

APPLICANT: Vernet, Corine 

APPLICANT: Zerhusen, Bryan D. 



; TITLE OF INVENTION: THERAPEUTIC POLYPEPTIDES , NUCLEIC ACIDS ENCODING SAME, 

AND METHODS OF USE 

; FILE REFERENCE: 21402-573B 

; CURRENT APPLICATION NUMBER: US/10/4 03,676 

; CURRENT FILING DATE: 2003-03-31 

; PRIOR APPLICATION NUMBER: 60/123,667 

; PRIOR FILING DATE: 1999-03-09 

; PRIOR APPLICATION NUMBER: 09/520,781 

; PRIOR FILING DATE: 2000-03-08 

; PRIOR APPLICATION NUMBER: 09/957,187 

; PRIOR FILING DATE: 2001-09-19 

; PRIOR APPLICATION NUMBER: 60/371,002 

PRIOR FILING DATE: 2002-04-09 
; PRIOR APPLICATION NUMBER: 60/127,352 
; PRIOR FILING DATE: 1999-04-01 

PRIOR APPLICATION NUMBER: 09/538,092 
; PRIOR FILING DATE: 2000-03-29 
; PRIOR APPLICATION NUMBER: 09/604,286 
; PRIOR FILING DATE: 2000-06-22 
; PRIOR APPLICATION NUMBER: 60/140,584 
; PRIOR FILING DATE: 1999-06-23 
; PRIOR APPLICATION NUMBER: 60/370,381 
; PRIOR FILING DATE: 2002-04-05 
; PRIOR APPLICATION NUMBER: 60/384,297 
; PRIOR FILING DATE: 2002-05-30 

; Remaining Prior Application data removed - See File Wrapper or PALM. 

; NUMBER OF SEQ ID NOS : 17 9 

; SOFTWARE: CuraSeqList version 0.1 

; SEQ ID NO 14 

LENGTH: 1047 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-403-676-14 



Query Match 99.5%; Score 5422.5; DB 12; Length 1047; 

Best Local Similarity 98.3%; Pred. No. 0; 

Matches 1029; Conservative 0; Mismatches 1; Indels 17; Gaps 1; 



Qy 


1 


MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 

II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 I I I | | | | | | | 

MRSEALLLYFTLLHFAGAGFPEDSEPISISHCNYTKQYPVFVGHKPGRNTTQRHRLDIQM 


60 


Db 


1 


60 


Qy 


61 


IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 

1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | | | | M | | | | | | | | | ! | | 

IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 


120 


Db 


61 


120 


Qy 


121 


IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 
1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I | | | | | | 
IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 


180 


Db 


121 


180 


Qy 


181 


KLYSATVTDFLAI DAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYI YFFFREIA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 | | | | | | | | | | 

KLYSATVTDFLAI DAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYI YFFFREIA 


240 


Db 


181 


240 


Qy 


241 


VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I | | | | | | | | | | | | 
VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 


300 


Db 


241 


300 



Qy 301 D VI R I NGRD WLAT F S T P YN S I P G S AVC AYDML D I AS VFT GRFKEQKSPDS T WT P VP D E R 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I II I I I I I I I I 
Db 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

Qy 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I M I II 
Db 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

Qy 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 4 80 

I II I II II I I I I I I I I I II I I I II I I I I I I I I I I I II II I I I I I I I I I I II I I I II I I I I 
Db 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 480 

Qy 481 GVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 540 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 481 GVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 540 

Qy 541 ACSHLS PNS RLTFEQDI ERGNTDGLGDCHNS FVALN GHSSSLL 583 

II II M M II I M II I I II I I I I I II II I I M I I M I M I M I 

Db 541 ACSHLS PNSRLTFEQDI ERGNTDGLGDCHNS FVALNDISTPLPDNEMSYNTVYGHSSSLL 600 

Qy 584 PSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHD 643 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 PSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHD 660 

Qy 644 QLVPVTLIAIAVILAFVMGAVFSGITWCVCDHRRKDVAVVQRKEKELTHSRRGSMSSVT 703 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 QLVP VT L LAI AVI LAFVMGAVFS G I T VYCVC DHRRKDVAWQ RKE KE LTH S RRG SMS S VT 720 

Qy " 704 KLSGLFGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTAXPTPESTPT 7 63 

I I I II M M I II I I M II II II M M I M I I I I II I I I I M I I I M I II I I I I I M I II I 

Db 721 KLSGLFGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPT 780 

Qy 764 LQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVI PTDLPLRASPSHIPSVWLPITQQ 823 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

Db 781 LQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVI PTDLPLRASPSHIPSVWLPITQQ 84 0 

Qy 824 GYQHEYVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQ 883 

II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II II I I I I II I I I I I I I I I I I I I I I I 

Db 841 GYQHEYVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQ 900 

Qy 884 REASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSN 943 

I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 901 REASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSN 960 

Qy 944 S S HLS RNQS FGRGDNP P PAPQRVDS I QVHS SQP S GQAVTVS RQP S LNAYNS LT RS GLKRT 1003 

I I I I I I I II I I I I II I II I I II I I I I I I I I I I I I I I I I I I I II II I I I I I II I I I I M II 
Db 961 S S HLS RNQ S FGRGDNP P PAPQRVDS I QVHS SQPS GQAVTVS RQP S LNAYN S LT RS GLKRT 1020 

Qy 1004 PSLKPDVPPKPSFAPLSTSMKPNDACT 1030 

I I I I I I I I I I I I I II I I I I I I I I I II I 
Db 1021 PSLKPDVPPKPSFAPLSTSMKPNDACT 1047 



RESULT 6 

US-10-449-548-14 

; Sequence 14, Application US/10449548 



Publication No. US20040018977A1 
GENERAL INFORMATION: 
APPLICANT: Alvarez, Enrique 
APPLICANT: Anderson, David W. 
APPLICANT: Dhanabal, Mohanraj 
APPLICANT: Khramtsov, Nikolai V. 
APPLICANT: LaRochelle, William J. 
APPLICANT: Li, Li 
APPLICANT: Lichenstein, Henri 
APPLICANT: Ooi, Chean Eng 
APPLICANT: Padigaru, Muralidhara 
APPLICANT: Shimkets, Richard A. 
APPLICANT: Zhong, Mei 

TITLE OF INVENTION: SEMAPHORIN-LIKE PROTEINS AND METHODS OF USING SAME 
FILE REFERENCE: 15966-540CIP2 
CURRENT APPLICATION NUMBER: US/10/449,548 
CURRENT FILING DATE: 2003-05-30 
PRIOR APPLICATION NUMBER: 09/520,781 
PRIOR FILING DATE: 2000-03-03 
PRIOR APPLICATION NUMBER: 60/123,667 
PRIOR FILING DATE: 1999-03-09 
PRIOR APPLICATION NUMBER: 60/234,082 
PRIOR FILING DATE: 2000-09-20 
PRIOR APPLICATION NUMBER: 60/233,798 
PRIOR FILING DATE: 2000-09-19 
PRIOR APPLICATION NUMBER: 60/174,485 
PRIOR FILING DATE: 2000-01-04 
PRIOR APPLICATION NUMBER: 10/403,676 
PRIOR FILING DATE: 2003-03-31 
PRIOR APPLICATION NUMBER: 60/371,002 
PRIOR FILING DATE: 2002-04-09 
PRIOR APPLICATION NUMBER: 60/384,798 
PRIOR FILING DATE: 2002-05-30 
PRIOR APPLICATION NUMBER: 60/402,407 
PRIOR FILING DATE: 2002-08-09 
PRIOR APPLICATION NUMBER: 60/443,062 
PRIOR FILING DATE: 2003-01-28 
NUMBER OF SEQ ID NOS : 58 
SOFTWARE: CuraSeqList version 0.1 
SEQ ID NO 14 
LENGTH: 1047 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-449-548-14 

Query Match 99.5%; Score 5422.5; DB 15; Length 1047; 

Best Local Similarity 98.3%; Pred. No. 0; 

Matches 1029; Conservative 0; Mismatches 1; Indels 17; Gaps 1; 

Qy 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHCNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

Qy 61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

I I I I I I I M I I I I I I I I I I I I I I I I I II I I I I I | | | | | | | | | | | | M I I I I I I I I I I I I I 
Db 61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 



Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


584 


Db 


601 


Qy 


644 


Db 


661 


Qy 


704 


Db 


721 


Qy 


764 


Db 


781 


Qy 


824 


Db 


841 


Qy 


884 


Db 


901 


Qy 


944 



IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 18 0 
I I I I M I II I I I I I I I I | I I I I M | | || | M | | | | | | | | | | | | | | || | | | | M I I I I II I 
I KVLLKKNDDALFVCGTNAFNP S CRNYKMDTLEP FGDEFS GMARC P YDAKHANVALFADG 180 

KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 
M I M I M I M I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 
I I I I I I I I I I I I I I I M I I M I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I | | | | | | 
DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 
I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | || 
VPKPRPGCCAGS S SLERYATSNEFPDDTLNFI KTHPLMDEAVPS I FNRPWFLRTMVRYRL 420 

TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 4 80 
I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
TKIAVDTAAGPYQNHTWFLGSEKGIILKFL7VRIGNSGFLNDSLFLEEMSVYNSEKCSYD 4 80 

GVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 54 0 
I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
GVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 54 0 

ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALN GHSSSLL 583 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNDISTPLPDNEMSYNTVYGHSSSLL 600 

PSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHD 643 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

PSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHD 660 

QLVPVTLLAI AVI LAFVMGAVFS GI T VYCVCDHRRKDVAWQRKEKELTHS RRGSMS S VT 7 03 
I I I I I I I I I I I I I I I I I I I | | | | | | | I I I I I I I I I I I I I I I I I | | | | | | | | | I I I I I I I I 
QLVPVTLLAI AVI LAFVMGAVFS GITVYCVCDHRRKDVAWQRKEKELTHS RRGSMS SVT 720 

KLSGLFGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPT 7 63 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
KLSGLFGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPT 7 8 0 

LQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQ 823 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
LQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQ 84 0 

GYQHEYVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQ 8 83 
I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GYQHEYVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQ 900 

REASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSN 94 3 
I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
REASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSN 960 

944 S S HL S RNQS FGRGDN P P PAPQRVD S I QVH S S Q P S GQAVT VS RQ P S LNAYN S LTRS GLKRT 1003 



Db 

Qy 
Db 



I I N I I M I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | M ! I I I I I 

961 SSHLSRNQSFGRGDNPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRT 1020 

1004 PSLKPDVPPKPSFAPLSTSMKPNDACT 1030 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1021 PSLKPDVPPKPSFAPLSTSMKPNDACT 1047 



RESULT 7 

US-10-403-676-28 

Sequence 28, Application US/10403676 
Publication No. US20040029150A1 
GENERAL INFORMATION: 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
TITLE OF INVENTION 
AND METHODS OF USE 

FILE REFERENCE: 21402-573B 

CURRENT APPLICATION NUMBER: US/10/4 03,676 
CURRENT FILING DATE: 2003-03-31 
PRIOR APPLICATION NUMBER: 60/123,667 
PRIOR FILING DATE: 1999-03-09 
PRIOR APPLICATION NUMBER: 09/520,781 
PRIOR FILING DATE: 2000-03-08 
PRIOR APPLICATION NUMBER: 09/957,187 
PRIOR FILING DATE: 2001-09-19 
PRIOR APPLICATION NUMBER: 60/371,002 
PRIOR FILING DATE: 2002-04-09 
PRIOR APPLICATION NUMBER: 60/127,352 
PRIOR FILING DATE: 1999-04-01 



Alsobrook II, John 
Anderson, David W. 
Boldog, Ferenc L. 
Burgess, Catherine E* 
Casman, Stacie J. 
Edinger, Shlomit R. 
Gerlach, Valerie L. 
Grosse, William M. 
Guo, Xiaojia 
Gusev, Vladimir Y. 
Ji, Weizhen 
LaRochelle, William J. 
Lepley, Denise M. 
Li, Li 

Liu, Xiaohong 
MacDougall, John R. 
Malyankar, Uriel M. 
Millet, Isabelle 
Padigaru, Muralidhara 
Patturajan, Meera 
Peyman, John A. 
Rastelli, Luca 
Reiger, Daniel 
Rothenberg, Mark E. 
Shimkets, Richard A. 
Stone, David J. 
Taupier, Raymond J. 
Vernet, Corine 
Zerhusen, Bryan D. 

THERAPEUTIC POLYPEPTIDES, NUCLEIC ACIDS ENCODING SAME, 



; PRIOR APPLICATION NUMBER: 09/538,092 

; PRIOR FILING DATE: 2000-03-29 

; PRIOR APPLICATION NUMBER: 09/604,286 

PRIOR FILING DATE: 2000-06-22 

PRIOR APPLICATION NUMBER: 60/140,584 
; PRIOR FILING DATE: 1999-06-23 
; PRIOR APPLICATION NUMBER: 60/370,381 

PRIOR FILING DATE: 2002-04-05 

PRIOR APPLICATION NUMBER: 60/384,297 

PRIOR FILING DATE: 2002-05-30 

Remaining Prior Application data removed - See File Wrapper or PALM. 
; NUMBER OF SEQ ID NOS : 17 9 

SOFTWARE: CuraSeqList version 0.1 
; SEQ ID NO 28 

LENGTH: 1018 
TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-403-676-28 

Query Match 98.5%; Score 5368; DB 12; Length 1018; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 1013; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

QY 17 GAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMIMIMNGTLYIAARDHI 76 

I : I I M M I I I I I I I I I I I I || I I | | | | | | | | | M I I I I I I II I I I I I I i I I I I I I I I I I 
Db 2 GSGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMIMIMNGTLYIAARDHI 61 

QY 77 YTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNFIKVLLKKNDD7VLFVCG 136 

N I I I I I I I II I I II I II I II I I I I I I I I I I I I I I I | | | | | | | | | | II I I I I I I I 

Db 62 YTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNFIKVLLKKNDDALFVCG 121 

Qy 137 TNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADGKLYSATVTDFLAIDAV 196 

I I I I I I M I II I I I I I I I I M I I || | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | || 
Db 122 TNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKH7\NVALFADGKLYSATVTDFLAIDAV 181 

QY 197 IYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVEYNTMGKWFPRVAQ 256 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I | | | | | | I I I I I I I I M I 
Db 182 IYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVEYNTMGKWFPRVAQ 241 

QY 257 VCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVTDVIRINGRDWLATFS 316 

M I I I I I II I I I I I I I I I M I I I I I I I I I I I I I I II I I I I I I M I I | | | | || | | | | | | | | 
Db 242 VCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVTDVIRINGRDWLATFS 301 

QY 317 TPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDERVPKPRPGCCAGSSSLE 376 

I I I I I I I I I I I I I I I I I I I | | | | | I I I I I I I I I I I I I I I M | | | | 

Db 302 TPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDERVPKPRPGCCAGSSSLE 361 

QY 377 RYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRLTKIAVDTAAGPYQNHT 436 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | || 
Db 362 RYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRLTKIAVDTAAGPYQNHT 421 

Qy 437 WFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRAS 496 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I 
Db 422 WFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRAS 481 

QY 497 SSLYVAFSTCVIKVPLGRCERHGKCKKTCIASRDPYCGWIKEGGACSHLSPNSRLTFEQD 556 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I II I I I II I I I I I I II I I I I I I I I I I I 



Db 



4 82 SSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGGACSHLSPNSRLTFEQD 541 



QY 557 IERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDST 616 

I M I I I I I I I I I I I I I I I I I I | | | | | | | | | | M | | | M I I I I I I I I I I I I I I I I I I I I I I 
Db 542 IERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDST 601 

Qy 617 DPLGAVSSHNHQDKKGVI RESYLKGHDQLVPVTLLAIAVI LAFVMGAVFSGITVYCVCDH 676 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 602 DPLGAVSSHNHQDKKGVI RESYLKGHDQLVPVTLLAIAVI LAFVMGAVFSGITVYCVCDH 661 

QY 677 RRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPEAILTPLMHNGKLATPG 736 

I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I 
Db 662 RRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPEAILTPLMHNGKLATPG 721 

QY 737 NTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERNQNLINACTKDMPPMGS 796 

I I I I I I I M I I I I I II I I I I I I I I I I | | | | | | | M | | | | | | | | | | | | | | | | | | || | | | | | 
Db 722 NTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERNQNLINACTKDMPPMGS 781 

Qy 797 PVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVAQMALEDQAATLEYKTIK 856 

I I I I I I I II I M I I I I I I I M I I I I I I II I I I I II I I I I I I I II I I I I I I I I I I I I I I | | 
Db 782 PVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVAQMALEDQAATLEYKTIK 841 

Qy 857 EHLSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLSQTGLSKRLEMHHSSSYGVDY 916 

I I I I M I I I M I I I I I I M I I I I II I M I I I I I I I I I | | | || || | | | | | M M | | | | | | | 
Db 842 EHLSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLSQTGLSKRLEMHHSSSYGVDY 901 

Qy 917 KRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGDNPPPAPQRVDSIQVHSSQP 976 

M I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I II I I I I 
Db 902 KRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGDNPPPAPQRVDSIQVHSSQP 961 

Qy 977 SGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKPDVPPKPSFAPLSTSMKPNDACT 1030 

M I I I I I I I I I I I I II I I I I I I I I M I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 962 SGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKPDVPPKPSFAPLSTSMKPNDACT 1015 



RESULT 8 

US-10-449-548-28 

Sequence 28, Application US/10449548 
Publication No. US20040018977A1 
GENERAL INFORMATION: 
APPLICANT: Alvarez, Enrique 
APPLICANT: Anderson, David W. 
APPLICANT: Dhanabal, Mohanraj 
APPLICANT: Khramtsov, Nikolai V. 
APPLICANT: LaRochelle, William J. 
APPLICANT: Li, Li 
APPLICANT: Lichenstein, Henri 
APPLICANT: Ooi, Chean Enq 
APPLICANT: Padigaru, Muralidhara 
APPLICANT: Shimkets, Richard A. 
APPLICANT: Zhong, Mei 

TITLE OF INVENTION: SEMAPHORIN-LIKE PROTEINS AND METHODS OF USING SAME 
FILE REFERENCE: 15966-54 0CI P2 
CURRENT APPLICATION NUMBER: US/10/449, 548 
CURRENT FILING DATE: 2003-05-30 
PRIOR APPLICATION NUMBER: 09/520,781 
PRIOR FILING DATE: 2000-03-03 



; PRIOR APPLICATION NUMBER: 60/123,667 

; PRIOR FILING DATE: 1999-03-09 

; PRIOR APPLICATION NUMBER: 60/234,082 

; PRIOR FILING DATE: 2000-09-20 

; PRIOR APPLICATION NUMBER: 60/233,798 

; PRIOR FILING DATE: 2000-09-19 

; PRIOR APPLICATION NUMBER: 60/174,485 

; PRIOR FILING DATE: 2000-01-04 

; PRIOR APPLICATION NUMBER: 10/403,676 

; PRIOR FILING DATE: 2003-03-31 

; PRIOR APPLICATION NUMBER: 60/371,002 

; PRIOR FILING DATE: 2002-04-09 

; PRIOR APPLICATION NUMBER: 60/384,798 

; PRIOR FILING DATE: 2002-05-30 

; PRIOR APPLICATION NUMBER: 60/402,407 

; PRIOR FILING DATE: 2002-08-09 

; PRIOR APPLICATION NUMBER: 60/443,062 

; PRIOR FILING DATE: 2003-01-28 

; NUMBER OF SEQ ID NOS : 58 

SOFTWARE : CuraSeqList version 0.1 
; SEQ ID NO 28 

LENGTH: 1018 
; TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-449-548-28 



Query Match 98.5%; Score 5368; DB 15; Length 1018; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 1013; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 17 GAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMIMIMNGTLYIAARDHI 7 6 

I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I 
Db 2 GSGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMIMIMNGTLYIAARDHI 61 



QY 77 YTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNFIKVLLKKNDDALFVCG 136 

I M I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I | | | | | I I I I I I I I I I I 
Db 62 YTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNFIKVLLKKNDDALFVCG 121 

Qy 137 TNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADGKLYSATVTDFLAIDAV 196 

I I I I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I || I I 
Db 122 TNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANV7VLFADGKLYSATVTDFLAIDAV 181 



Qy 197 IYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVEYNTMGKWFPRVAQ 256 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 182 IYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVEYNTMGKWFPRVAQ 241 

Qy 257 VCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVTDVIRINGRDWLATFS 316 

I M I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I | | | | | | | | || || | | | | | | | | | | | | | 
Db 242 VCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVTDVIRINGRDWLATFS 301 

Qy 317 TP YNS I PGSAVCAYDMLDI ASVFTGRFKEQKS PDSTWTPVPDERVPKPRPGCCAGS S SLE 376 

I M I I I I I I I I I I I I I I I I I I I I M I I I I | | I I I I | | | | | | | | | | | | | | | | | | | | | | | | | 
Db 302 TPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDERVPKPRPGCCAGSSSLE 361 

Qy 377 RYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMWYRLTKIAVDTAAGPYQNHT 436 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 362 RYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRLTKIAVDTAAGPYQNHT 421 



Qy 437 WFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRAS 496 

I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 422 WFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRAS 481 

QY 4 97 SSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGGACSHLSPNSRLTFEQD 556 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I II I I I I I I I I I I I I I I I I I 
Db 4 82 SSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGGACSHLSPNSRLTFEQD 541 

Qy 557 IERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDST 616 

M M M I I I I I I I I I I I I I I I I I I I I I | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 542 IERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDST 601 

Qy 617 DPLGAVSSHNHQDKKGVI RESYLKGHDQLVPVTLLAIAVI LAFVMGAVFSGITVYCVCDH 676 

I I M I I I I I I I I I I I I I I I I I I I I || I I I | | | | | | | | | || | | | | | | | | | | | | | | | | | | | | 
Db 602 DPLGAVS SHNHQDKKGVI RESYLKGHDQLVPVTLLAIAVI LAFVMGAVFSGITVYCVCDH 661 

Qy 677 RRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPEAILTPLMHNGKLATPG 736 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I 
Db 662 RRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPEAILTPLMHNGKLATPG 721 

Qy 737 NTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERNQNLINACTKDMPPMGS 796 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 722 NTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERNQNLINACTKDMPPMGS 7 81 

Qy 797 PVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVAQMALEDQAATLEYKTIK 856 

M I I I I I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 782 PVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVAQMALEDQAATLEYKTIK 841 

Qy 857 EHLSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLSQTGLSKRLEMHHSSSYGVDY 916 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I II I 
Db 842 EHLSSKSPNHGWLVENLDSLPPKVPQREASLGPPGASLSQTGLSKRLEMHHSSSYGVDY 901 

Qy 917 KRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGDNPPPAPQRVDSIQVHSSQP 976 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 902 KRSYPTNSLTRSHQATTLKRNNTNSSMSSHLSRNQSFGRGDNPPPAPQRVDSIQVHSSQP 961 

Qy 977 S GQAVT VS RQ P S LNAYN S LT RS GLKRT P S LKP DVP P KP S FAP L ST SMKPNDACT 1030 

I I I I I I I I I I I I I I I I I I I I I I | | | | | | I I I I I I I I I M I I I | | | | | | | | | | | | 
Db 962 SGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKPDVPPKPSFAPLSTSMKPNDACT 1015 



RESULT 9 

US-10-403-676-18 

; Sequence 18, Application US/10403676 
; Publication No. US20040029150A1 
; GENERAL INFORMATION: 



APPLICANT 


* Alsobrook 


II, John 


APPLICANT 


Anderson 


, David W. 


APPLICANT 


Boldog, 


Ferenc L. 


APPLICANT 


Burgess , 


Catherine E 


APPLICANT 


Casman, 


Stacie J. 


APPLICANT 


Edinger , 


Shi omit R. 


APPLICANT 


Gerlach, 


Valerie L. 


APPLICANT 


Grosse, 


William M. 


APPLICANT 


Guo, Xia 


ojia 


APPLICANT. 


Gusev, Vladimir Y. 



Ji, Weizhen 

LaRochelle, William J. 
Lepley, Denise M. 
Li, Li 

Liu, Xiaohong 
MacDougall, John R. 
Malyankar, Uriel M. 
Millet, Isabelle 
Padigaru, Muralidhara 
Patturajan, Meera 
Peyraan, John A. 
Rastelli, Luca 
Reiger, Daniel 
Rothenberg, Mark E. 
Shimkets, Richard A. 
Stone, David J. 
Taupier, Raymond J. 
Vernet, Corine 
Zerhusen, Bryan D. 

THERAPEUTIC POLYPEPTIDES, NUCLEIC ACIDS ENCODING SAME, 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
TITLE OF INVENTION 
AND METHODS OF USE 

FILE REFERENCE: 21402-573B 

CURRENT APPLICATION NUMBER: US/10/403, 676 
CURRENT FILING DATE: 2003-03-31 
PRIOR APPLICATION NUMBER: 60/123,667 
PRIOR FILING DATE: 1999-03-09 
PRIOR APPLICATION NUMBER: 09/520,781 
PRIOR FILING DATE: 2000-03-08 
PRIOR APPLICATION NUMBER: 09/957,187 
PRIOR FILING DATE: 2001-09-19 
PRIOR APPLICATION NUMBER: 60/371,002 
PRIOR FILING DATE: 2002-04-09 
PRIOR APPLICATION NUMBER: 60/127,352 
PRIOR FILING DATE: 1999-04-01 
PRIOR APPLICATION NUMBER: 09/538,092 
PRIOR FILING DATE: 2000-03-29 
PRIOR APPLICATION NUMBER: 09/604,286 
PRIOR FILING DATE: 2000-06-22 
PRIOR APPLICATION NUMBER: 60/140,584 
PRIOR FILING DATE: 1999-06-23 
PRIOR APPLICATION NUMBER: 60/370,381 
PRIOR FILING DATE: 2002-04-05 
PRIOR APPLICATION NUMBER: 60/384,297 
PRIOR FILING DATE: 2002-05-30 

Remaining Prior Application data removed - See File Wrapper or PALM. 
NUMBER OF SEQ ID NOS : 179 
SOFTWARE: CuraSeqList version 0.1 
SEQ ID NO 18 
LENGTH: 1035 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-403-676-18 



Query Match 98.2%; Score 5349.5; DB 12; Length 1035; 

Best Local Similarity 98.3%; Pred. No. 0; 

Matches 1013; Conservative 1; Mismatches 0; Indels 17; Gaps 1; 



17 GAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMIMIMNGTLYIAARDHI 76 
I : I ' I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
2 GSGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMIMIMNGTLYIAARDHI 61 

77 YTVDI DT SHTEEI YCS KKLTWKS RQADVDTCRMKGKHKDECHN FI KVLLKKNDDALFVCG 136 

I I I I N I I I I I I I I I I I I I I I I | | | | | | | | | | | | M | | | | | | | | | | | || | | | | | | | | | | | 
62 YTVDI DT SHTEEI YCS KKLTWKS RQADVDTCRMKGKHKDECHNFI KVLLKKNDDALFVCG 121 

137 TMAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADGKLYSATVTDFLAIDAV 196 

I M I I I I I I I I I II I I I I I I | | | | | | | | | | | | | | | || | | | | | | | | | | | | || | | | | | | | | | 
122 TNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADGKLYSATVTDFLAI DAV 181 

197 IYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVEYNTMGKWFPRVAQ 256 

I I M I I I I I II I I I I I I I I I | | | | | | | | | | | | | | | | | | || M I I I I M I I I I I I I 

182 IYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVEYNTMGKWFPRVAQ 241 

257 VCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVTDVIRINGRDWLATFS 316 

I I I I I M I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
242 VCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVTDVIRINGRDWLATFS 301 

317 TPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDERVPKPRPGCCAGSSSLE 376 

I M I I M I I I I I I II I I I I I I I I | | | | | | | | || | | | | | | | | | | | | | | | | | | | | | | | | | | | 
302 TP YNS I PGSAVCAYDMLDIASVFTGRFKEQKS PDSTWTPVPDERVPKPRPGCCAGS SS LE 361 

377 RYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRLTKIAVDTAAGPYQNHT 4 36 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I | | | | | I I I I I I I I I I I I I I I I I | | | | | | | M 
362 RYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRLTKIAVDTAAGPYQNHT 421 

437 WFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRAS 496 

I I I I I M I I I I I I I I I I I I I I I I I I I | | | | | | | | || | | || | || | | | || | | || | | | | | | | | 
422 WFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRAS 481 

497 S SLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDP YCGWI KEGGACSHLS PNS RLTFEQD 556 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I M I I I I I I II I I I I 
482 S SLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWI KEGGACSHLS PNS RLTFEQD 541 

557 IERGNTDGLGDCHNSFVALN GHSSSLLPSTTTSDSTAQEGYES 599 

I M I I I II I I I I I II I I I I I I I I I II II Ml I I I I I I I 

542 IERGNTDGLGDCHNSFVALNDISTPLPDNEMSYNTVYGHSSSLLPSTTTSDSTAQEGYES 601 

600 RGGMLDWKHLLDS PDSTDPLGAVS SHNHQDKKGVI RES YLKGHDQLVPVTLLAIAVI LAF 659 

I I I I I I I M I I I I I I I I I I I I I I I I I I I II II I I I II I I I I I I II I I I I I I || I I I || | | 
602 RGGMLDWKHLLDS PDSTDPLGAVS SHNHQDKKGVI RES YLKGHDQLVPVTLLAIAVI LAF 661 

660 VMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKP 719 

I I I I I M I I I I M I I I I I I | || | | | | | || | | | | | | | II | | | | | | || | | | || | | | | | | | | | 
662 VMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKP 721 

720 EAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWER 779 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I | | | | | | | | 
722 EAI LTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWER 781 

780 NQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVA 839 

I I M I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I | I || | | | | | | | || | | || | | 
782 NQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVA 841 

840 QMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLSQTG 899 



Db 842 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | 1 1 1 I II 1 1 1 1 I I I 


QMALEDQAATLEYKTI KEHLS SKS PNHGVNLVENLDSLPPKVPQREAS LGP PGASLSQTG 901 


Qy 900 


LSKRLEMHHSSSYGVDYKRSYPTNSLTRSHOATTT.KRWMTTvrq^M^^HT qRMn^FrRrnMP qrq 




1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 


Db 902 


LSKRLEMHHSS S YGVDYKRSYPTNSLTRSHQATTLKRNNTNS SNS SHLSRNQS FGRGDNP 961 


Qy 960 


PPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKPDVPPKPSFAPL 1019 




N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I | | | | | | | | | | | | | | M M 1 1 1 1 1 1 


Db 962 


ffAf^KVUbiy VttbbyFbtjy/\Vl VbRQPSLNAYNSLTRSGLKRTPSLKPDVPPKPSFAPL 1021 




0 1 oi v iJ\ r IN JJ/\L- 1 X U o U 

1 1 1 1 1 1 1 1 1 1 1 


Db 1022 


1 1 1 1 1 It 1 1 1 1 

ST SMKPTJDAPT 1 0^9 


RESULT 10 




US-10-449-548- 


-18 


; Sequence 18, 


Application US/10449548 


; Publication 


No. US20040018977A1 


; GENERAL INFORMATION: 


; APPLICANT: 


Alvarez, Enrique 


; APPLICANT: 


Anderson, David W. 


; APPLICANT: 


Dhanabal , Mohanra j 


; APPLICANT: 


Khramtsov, Nikolai V. 


; APPLICANT: 


LaRochelle, William J. 


; APPLICANT: 


Li, Li 


; APPLICANT: 


Lichenstein, Henri 


; APPLICANT: 


Ooi, Chean Eng 


; APPLICANT: 


Padigaru, Muralidhara 


; APPLICANT: 


Shimkets, Richard A. 


; APPLICANT: 


Zhong, Mei 



TITLE OF INVENTION: SEMAPHORIN-LIKE PROTEINS AND METHODS OF USING SAME 

FILE REFERENCE: 15966-540CIP2 

CURRENT APPLICATION NUMBER: US/10/449,548 

CURRENT FILING DATE: 2003-05-30 

PRIOR APPLICATION NUMBER: 09/520,781 

PRIOR FILING DATE: 2000-03-03 

PRIOR APPLICATION NUMBER: 60/123,667 

PRIOR FILING DATE: 1999-03-09 

PRIOR APPLICATION NUMBER: 60/234,082 

PRIOR FILING DATE: 2000-09-20 

PRIOR APPLICATION NUMBER: 60/233,798 

PRIOR FILING DATE: 2000-09-19 

PRIOR APPLICATION NUMBER: 60/174,485 

PRIOR FILING DATE: 2000-01-04 

PRIOR APPLICATION NUMBER: 10/403,676 

PRIOR FILING DATE: 2003-03-31 

PRIOR APPLICATION NUMBER: 60/371,002 

PRIOR FILING DATE: 2002-04-09 

PRIOR APPLICATION NUMBER: 60/384,798 

PRIOR FILING DATE: 2002-05-30 

PRIOR APPLICATION NUMBER: 60/402,407 

PRIOR FILING DATE: 2002-08-09 

PRIOR APPLICATION NUMBER: 60/443,062 

PRIOR FILING DATE: 2003-01-28 

NUMBER OF SEQ ID NOS : 58 



SOFTWARE: CuraSeqList version 0.1 
SEQ ID NO 18 
LENGTH: 1035 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-449-548-18 

Query Match 98.2%; Score 5349.5; DB 15; Length 1035; 

Best Local Similarity 98.3%; Pred. No. 0; 

Matches 1013; Conservative 1; Mismatches 0; Indels 17; Gaps 1; 

Qy 17 GAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMIMIMNGTLYIAARDHI 76 

1 = 111111111 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2 GSGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMIMIMNGTLYIAARDHI 61 

Qy 77 YTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNFIKVLLKKNDDALFVCG 136 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
Db 62 YTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNFIKVLLKKNDDALFVCG 121 

Qy 137 TNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAJ^HANVALFADGKLYSATVTDFLAIDAV 196 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 122 TNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVT^FADGKLYSATVTDFLAIDAV 181 

Qy 197 IYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVEYNTMGKWFPRVAQ 256 

I I I I I I M I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I || I I I I 
Db 182 IYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVEYNTMGKWFPRVAQ 241 

Qy 257 VCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVTDVI RINGRDWLATFS 316 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 242 VCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVTDVIRINGRDVVLATFS 301 

Qy 317 TPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDERVPKPRPGCCAGSSSLE 376 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 302 TPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDERVPKPRPGCCAGSSSLE 361 

Qy 377 RYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRLTKIAVDTAAGPYQNHT 436 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 362 RYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRLTKIAVDTAAGPYQNHT 421 

Qy 437 WFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRAS 496 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 422 WFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRAS 481 

Qy 497 SSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGGACSHLSPNSRLTFEQD 556 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I M I I I I I 
Db 4 82 SSLYVAFSTCVIKVPLGRCERHGKCKKTCIASRDPYCGWIKEGGACSHLSPNSRLTFEQD 541 

Qy 557 IERGNTDGLGDCHNSFVALN GHSSSLLPSTTTSDSTAQEGYES 599 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 542 IERGNTDGLGDCHNSFVALNDISTPLPDNEMSYNTVYGHSSSLLPSTTTSDSTAQEGYES 601 

Qy 600 RGGMLDWKHLLDS PDSTDPLGAVS SHNHQDKKGVI RES YLKGHDQLVPVTLLAI AVI LAF 659 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 602 RGGMLDWKHLLDS PDSTDPLGAVS SHNHQDKKGVI RES YLKGHDQLVPVTLLAI AVI LAF 661 



Qy 



660 VMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKP 719 
I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 



662 



VMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKP 721 



Qy 720 EAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWER 779 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I | I | | | M | | | | | | | | | | I I I I I I I 
Db 722 EAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWER 7 81 

Qy 780 NQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVA 839 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 782 NQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVA 841 

Qy 840 QMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLSQTG 899 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 842 QMALEDQAATLEYKTIKEHLSSKSPNHGWLVENLDSLPPKVPQREASLGPPGASLSQTG 901 

Qy 900 LSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGDNP 959 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I 
Db 902 LSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGDNP 961 

Qy 960 P PAPQRVDS I QVHS SQ P S GQAVTVS RQP S LNAYN S LT RS GL KRT P S LKP DVPP KP S FAP L 1019 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 962 P PAPQRVDS IQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKPDVPPKPS FAP L 1021 

Qy 1020 STSMKPNDACT 1030 

I I I I I I I I I I I 
Db 1022 STSMKPNDACT 1032 



RESULT 11 
US-10-403-676-20 

Sequence 20, Application US/10403676 
Publication No. US20040029150A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Alsobrook II, John 
Anderson, David W. 
Boldog, Ferenc L. 
Burgess, Catherine E. 
Casman, Stacie J. 
Edinger, Shlomit R. 
Gerlach, Valerie L. 
Grosse, William M. 
Guo, Xiaojia 
Gusev, Vladimir Y. 
Ji, Weizhen 
LaRochelle, William J. 
Lepley, Denise M. 
Li, Li 

Liu, Xiaohong 
MacDougall, John R. 
Malyankar, Uriel M. 
Millet, Isabelle 
Padigaru, Muralidhara 
Patturajan, Meera 
Peyman, John A. 
Rastelli, Luca 
Reiger, Daniel 
Rothenberg, Mark E. 
Shimkets, Richard A. 



APPLICANT : Stone, David J. 
APPLICANT: Taupier, Raymond J. 
APPLICANT: Vernet, Corine 
APPLICANT: Zerhusen, Bryan D. 

TITLE OF INVENTION: THERAPEUTIC POLYPEPTIDES, NUCLEIC ACIDS ENCODING SAME, 
AND METHODS OF USE 

FILE REFERENCE: 21402-573B 

CURRENT APPLICATION NUMBER: US/10/403,676 
CURRENT FILING DATE: 2003-03-31 
PRIOR APPLICATION NUMBER: 60/123,667 
PRIOR FILING DATE: 1999-03-09 
PRIOR APPLICATION NUMBER: 09/520,781 
PRIOR FILING DATE: 2000-03-08 
PRIOR APPLICATION NUMBER: 09/957,187 
PRIOR FILING DATE: 2001-09-19 
PRIOR APPLICATION NUMBER: 60/371,002 
PRIOR FILING DATE: 2002-04-09 
PRIOR APPLICATION NUMBER: 60/127,352 
PRIOR FILING DATE: 1999-04-01 
PRIOR APPLICATION NUMBER: 09/538,092 
PRIOR FILING DATE: 2000-03-29 
PRIOR APPLICATION NUMBER: 09/604,286 
PRIOR FILING DATE: 2000-06-22 
PRIOR APPLICATION NUMBER: 60/140,584 
PRIOR FILING DATE: 1999-06-23 
PRIOR APPLICATION NUMBER: 60/370,381 
PRIOR FILING DATE: 2002-04-05 
PRIOR APPLICATION NUMBER: 60/384,297 
PRIOR FILING DATE: 2002-05-30 

Remaining Prior Application data removed - See File Wrapper or PALM. 
NUMBER OF SEQ ID NOS : 17 9 
SOFTWARE: CuraSeqList version 0.1 
SEQ ID NO 20 
LENGTH: 998 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-403-676-20 

Query Match 94.2%; Score 5134.5; DB 12; Length 998; 

Best Local Similarity 98.3%; Pred. No. 0; 

Matches 975; Conservative 0; Mismatches 0; Indels 17; Gaps 1; 

Qy 56 LDIQMIMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKD 115 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I 
Db 4 LDIQMIMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKD 63 

Qy 116 ECHNFIKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVA 175 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 64 ECHNFIKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVA 123 

Qy 176 LFADGKLYSATVTDFLAI DAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFF 235 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 124 LFADGKLYSATVTDFLAI DAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFF 183 

Qy 236 FREIAVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNI 295 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I 
Db 184 FREIAVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNI 243 



Qy 296 LQAVTDVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTP 355 

M I I I I M I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 244 LQAVTDVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTP 303 

Qy 356 VPDERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTM 415 

I M I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I | | | | | | | M | | | | | | | | | | | | | | I I I 
Db 304 VPDERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTM 363 

Qy 416 VRYRLTKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSE 475 

I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 364 VRYRLTKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSE 423 

Qy 47 6 KCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGW 535 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 424 KCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGW 483 

Qy 536 I KEGGACSHLS PNSRLTFEQDIERGNTDGLGDCHNS FVALN GH 578 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I || 
Db 484 IKEGGACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNDISTPLPDNEMSYNTVYGH 543 

Qy 579 SSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRESY 638 

I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I M I I I I I I I I I I I I || I I I I I || 
Db 544 SSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESY 603 

Qy 639 LKGHDQLVPVTLLAIAVT LAFVMGAVFSGITVYCVCDHRRKDVAVVQRKEKELTHSRRGS 698 

I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I 
Db 604 LKGHDQLVPVTLLAIAVT IAFVMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGS 663 

Qy 699 MSSVTKLSGLFGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTP 758 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I 
Db 664 MSSVTKLSGLFGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTP 723 

Qy 759 ESTPTLQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWL 818 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II 
Db 724 ESTPTLQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWL 783 

Qy 819 PITQQGYQHEYVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLP 878 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 784 PITQQGYQHEYVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLP 843 

Qy 879 PKVPQREASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNN 938 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 844 PKVPQREASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNN 903 

Qy 939 TNSSNSSHLSRNQSFGRGDNPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRS 998 

I I I I I M I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

Db 904 TNSSNSSHLSRNQSFGRGDNPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRS 963 

Qy 999 GLKRTPSLKPDVPPKPSFAPLSTSMKPNDACT 1030 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 964 GLKRTPSLKPDVPPKPSFAPLSTSMKPNDACT 995 



RESULT 12 
US-10-449-548-20 

; Sequence 20, Application US/10449548 



Publication No. US2004 0018977A1 
GENERAL INFORMATION: 
APPLICANT: Alvarez, Enrique 
APPLICANT: Anderson, David W. 
APPLICANT: Dhanabal, Mohanraj 
APPLICANT: Khramtsov, Nikolai V. 
APPLICANT: LaRochelle, William J. 
APPLICANT: Li, Li 
APPLICANT: Lichenstein, Henri 
APPLICANT: Ooi, Chean Eng 
APPLICANT: Padigaru, Muralidhara 
APPLICANT: Shimkets, Richard A. 
APPLICANT: Zhong, Mei 

TITLE OF INVENTION: SEMAPHORIN-LIKE PROTEINS AND METHODS OF USING SAME 
FILE REFERENCE: 15966-540CIP2 
CURRENT APPLICATION NUMBER: US/10/449,548 
CURRENT FILING DATE: 2003-05-30 
PRIOR APPLICATION NUMBER: 09/520,781 
PRIOR FILING DATE: 2000-03-03 
PRIOR APPLICATION NUMBER: 60/123,667 
PRIOR FILING DATE: 1999-03-09 
PRIOR APPLICATION NUMBER: 60/234,082 
PRIOR FILING DATE: 2000-09-20 
PRIOR APPLICATION NUMBER: 60/233,798 
PRIOR FILING DATE: 2000-09-19 
PRIOR APPLICATION NUMBER: 60/174,485 
PRIOR FILING DATE: 2000-01-04 
PRIOR APPLICATION NUMBER: 10/403,676 
PRIOR FILING DATE: 2003-03-31 
PRIOR APPLICATION NUMBER: 60/371,002 
PRIOR FILING DATE: 2002-04-09 
PRIOR APPLICATION NUMBER: 60/384,798 
PRIOR FILING DATE: 2002-05-30 
PRIOR APPLICATION NUMBER: 60/402,407 
PRIOR FILING DATE: 2002-08-09 
PRIOR APPLICATION NUMBER: 60/443,062 
PRIOR FILING DATE: 2003-01-28 
NUMBER OF SEQ ID NOS : 58 
SOFTWARE: CuraSeqList version 0.1 
SEQ ID NO 20 
LENGTH: 998 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-449-548-20 

Query Match 94.2%; Score 5134.5; DB 15; Length 998; 

Best Local Similarity 98.3%; Pred. No. 0; 

Matches 975; Conservative 0; Mismatches 0; Indels 17; Gaps 1; 

Qy 56 LDIQMIMIMNGTLYIAARDHIYTVDIDTSHTEEI YCSKKLTWKSRQADVDTCRMKGKHKD 115 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 LDIQMIMIMNGTLYIAARDHIYTVDIDTSHTEEI YCSKKLTWKSRQADVDTCRMKGKHKD 63 

Qy 116 ECHNFIKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVA 175 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 64 ECHNFIKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVA 123 



176 LFADGKLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFF 235 

I N I I I II I I I I I I I I I I I M | | | | | | | | | | | | | M I I I I II II I I I I I I I I II I II I I I 
124 LFADGKLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFF 183 

236 FREIAVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNI 295 

I N I I I N I I I I I I I I I I I I II I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I 
184 FREIAVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNI 243 

296 LQAVTDVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTP 355 

N I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
244 LQAVTDVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTP 303 

356 VPDERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTM 415 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | M I I I I I I I I I I I I I I I I I I I I 
304 VPDERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTM 363 

416 VRYRLTKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSE 475 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I | | 
364 VRYRLTKI AVDTAAGP YQMHT WFLGS EKGI I LKFLARI GNS GFLNDS LFLEEMS VYNS E 423 

476 KCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGW 535 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

424 KCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGW 4 83 

536 I KEGGACSHLS PNSRLTFEQDI ERGNTDGLGDCHNS FVALN GH 578 

I I I I M I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I | | 

484 I KEGGACSHLS PNSRLTFEQDI ERGNTDGLGDCHNS FVALNDISTPLPDNEMSYNTVYGH 543 

579 SSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESY 638 

I M I I I II I I I I I I I I I I I I I I | | | | | | || | | | | | | || | | | | | | | | | | | | | | || | | | | | | 
544 SSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRESY 603 

639 LKGHDQLVPVTLLAIAVILAFVMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGS 698 

I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I | II I I I I I I I I I I I I I I I I I 

604 LKGHDQLVPVTLLAIAVILAFVMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGS 663 

699 MSSWKLSGLFGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTP 758 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I I I I I I I I I I I I I I I | | | | | M I I 
664 MSSVTKLSGLFGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTP 723 

759 ESTPTLQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWL 818 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
724 ESTPTLQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWL 783 

819 PITQQGYQHEYVDQPKMS EVAQMALEDQAATLEYKTIKEHLSSKSPNHGWLVENLDSLP 87 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II | | | | | | | | | M | | | | 
784 PITQQGYQHEYVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLP 843 

879 PKVPQREASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNN 938 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
844 PKVPQREASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNN 903 

939 TNSSNSSHLSRNQSFGRGDNPPPAPQRVDSIQYHSSQPSGQAVTVSRQPSLNAYNSLTRS 998 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

904 TNSSNSSHLSRNQSFGRGDNPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRS 963 

999 GLKRTPSLKPDVPPKPSFAPLSTSMKPNDACT 1030 



Db 964 GLKRTPSLKPDVPPKPSFAPLSTSMKPNDACT 995 



RESULT 13 
US-10-403-676-30 

Sequence 30, Application US/10403676 

Publication No. US20040029150A1 
GENERAL INFORMATION: 

APPLICANT: Alsobrook II, John 

APPLICANT: Anderson, David W. 

APPLICANT: Boldog, Ferenc L. 

APPLICANT: Burgess, Catherine E. 

APPLICANT: Casman, Stacie J. 

APPLICANT: Edinger, Shlomit R. 

APPLICANT: Gerlach, Valerie L. 

APPLICANT: Grosse, William M- 

APPLICANT: Guo, Xiaojia 

APPLICANT: Gusev, Vladimir Y. 

APPLICANT: Ji, Weizhen 

APPLICANT: LaRochelle, William J. 

APPLICANT: Lepley, Denise M. 

APPLICANT: Li, Li 

APPLICANT: Liu, Xiaohong 

APPLICANT: MacDougall, John R. 

APPLICANT: Malyankar, Uriel M. 

APPLICANT: Millet, Isabelle 

APPLICANT: Padigaru, Muralidhara 

APPLICANT: Patturajan, Meera 

APPLICANT: Peyman, John A. 

APPLICANT: Rastelli, Luca 

APPLICANT: Reiger, Daniel 

APPLICANT: Rothenberg, Mark E. 

APPLICANT: Shimkets, Richard A. 

APPLICANT: Stone, David J. 

APPLICANT: Taupier, Raymond J. 

APPLICANT: Vernet, Corine 



Zerhusen, Bryan D. 

THERAPEUTIC POLYPEPTIDES, NUCLEIC ACIDS ENCODING SAME, 



APPLICANT: 
TITLE OF INVENTION 
AND METHODS OF USE 

FILE REFERENCE: 214 02-573B 

CURRENT APPLICATION NUMBER: US/10/403, 676 
CURRENT FILING DATE: 2003-03-31 
PRIOR APPLICATION NUMBER: 60/123,667 
PRIOR FILING DATE: 1999-03-09 
PRIOR APPLICATION NUMBER: 09/520,781 
PRIOR FILING DATE: 2000-03-08 
PRIOR APPLICATION NUMBER: 09/957,187 
PRIOR FILING DATE: 2001-09-19 
PRIOR APPLICATION NUMBER: 60/371,002 
PRIOR FILING DATE: 2002-04-09 
PRIOR APPLICATION NUMBER: 60/127,352 
PRIOR FILING DATE: 1999-04-01 
PRIOR APPLICATION NUMBER: 09/538,092 
PRIOR FILING DATE: 2000-03-29 
PRIOR APPLICATION NUMBER: 09/604,286 
PRIOR FILING DATE: 2000-06-22 



; PRIOR APPLICATION NUMBER: 60/140,584 

; PRIOR FILING DATE: 1999-06-23 

; PRIOR APPLICATION NUMBER: 60/370,381 

PRIOR FILING DATE: 2002-04-05 
; PRIOR APPLICATION NUMBER: 60/384,297 
; PRIOR FILING DATE: 2002-05-30 

; Remaining Prior Application data removed - See File Wrapper or PALM. 

; NUMBER OF SEQ ID NOS : 179 

; SOFTWARE: CuraSeqList version 0.1 

; SEQ ID NO 30 

LENGTH: 981 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-403-676-30 



Query Match 93.8%; Score 5113.5; DB 12; Length 981; 

Best Local Similarity 94.6%; Pred. No. 0; 

Matches 974; Conservative 0; Mismatches 1; Indels 55; Gaps 1; 

QY 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

I I i I I I M M I I I I I I I II I I I I I I I I I | | | | | | | | | | | | M I I I I I I I I I I I I I I I I I I 
Db 4 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 63 

QY 61 IMIMNGTLYIT^ARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 12 0 

I M I I I I M I I I I I I I I I I I I I M I I I I I I I I I I II I I I I I I I | | | | | | | | | | | | | | | | | 
Db 64 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 123 



Qy 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGM7VRCPYDAKHANVALFADG 18 0 

M I I M I M I I I I I I I I I M I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I 
Db 124 I WLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 183 

Qy 181 KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

I I I I I I I I M I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I II | | | | | | | | 
Db 184 KLYSATVTDFLAI DAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 243 

Qy 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
Db 244 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 303 

Qy 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | I I I I I I I I I I I I I 
Db 304 DVIRIKGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 363 

Qy 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 364 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 423 

Qy 421 TKIAVDTAAGPYQNHTVVFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 480 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
Db 424 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 4 83 



Qy 481 GVEDKRIMGMQLDRAS S S LYVAFSTCVI KVPLGRCERHGKCKKTCI ASRDP YCGWI KEGG 540 

I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | I I I I I I I I I I I II I I I I I I I I I I 
Db 484 GVEDKRIMGMQLDRAS SS LYVAFSTCVI KVPLGRCERHGKCKKTC IAS RDP YCGWI KEGG 543 

Qy 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESR 600 

I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I 



Db 



544 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALN 



579 



Qy 601 GGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHDQLVPVTLLAIAVI LAFV 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 580 GVIRESYLKGHDQLVPVTLLAIAVILAFV 608 

QY 661 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 720 

I M I I I I I M M I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 609 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 668 

Qy 721 AI LT PLMHNGKLAT PGNTAKMLI KADQHHLDLTALPT PEST PTLQQKRKP S RGS REWERN 780 

I I I M I I I I I M I I I M I I I I I I I I I I I I I I | | | | | | M I I I I I I II I I || I I || I I I I I 
Db 669 AI LT PLMHNGKLAT PGNTAKMLI KADQHHLDLTALPT PEST PTLQQKRKP S RGS REWERN 728 

Qy 781 QNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVAQ 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I II I I I I I I I I I I I M I I I I I I I 
Db 729 QNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVAQ 788 

Qy 841 MALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLSQTGL 900 

I I I I I I I I I I I I I I I I I I M I I I M I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 789 MALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLSQTGL 848 

Qy 901 SKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGDNPP 960 

I I I I I I I I I I I M I I I I I I I I M I I I II I I I I I I I I I I I I I I I I I I II I I I I I I | | | | || 
Db 84 9 SKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGDNPP 908 

QY 961 PAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKPDVPPKPSFAPLS 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I M I I 
Db 909 PAP Q RVDS I Q VH S S Q P S GQAVT VS RQ P S LNAYN S LT RS GL KRT P S L KP D VP P K P S FAP L S 968 

Qy 1021 TSMKPNDACT 1030 

I I I I I I I I I I 
Db 969 TSMKPNDACT 97 8 



RESULT 14 
US-10-449-548-30 

Sequence 30, Application US/10449548 
Publication No. US20040018977A1 
GENERAL INFORMATION: 
APPLICANT: Alvarez, Enrique 
APPLICANT: Anderson, David W. 
APPLICANT: Dhanabal, Mohanraj 
APPLICANT: Khramtsov, Nikolai V. 
APPLICANT: LaRochelle, William J. 
APPLICANT: Li, Li 
APPLICANT: Lichenstein, Henri 
APPLICANT: Ooi f Chean Eng 
APPLICANT: Padigaru, Muralidhara 
APPLICANT: Shimkets, Richard A. 
APPLICANT: Zhong, Mei 

TITLE OF INVENTION: SEMAPHORIN-LIKE PROTEINS AND METHODS OF USING SAME 
FILE REFERENCE: 15966-540CIP2 
CURRENT APPLICATION NUMBER: US/10/449,548 
CURRENT FILING DATE: 2003-05-30 
PRIOR APPLICATION NUMBER: 09/520,781 
PRIOR FILING DATE: 2000-03-03 



; PRIOR APPLICATION NUMBER: 60/123,667 

; PRIOR FILING DATE: 1999-03-09 

; PRIOR APPLICATION NUMBER: 60/234,082 

PRIOR FILING DATE: 2000-09-20 
; PRIOR APPLICATION NUMBER: 60/233,7 98 
; PRIOR FILING DATE: 2000-09-19 
; PRIOR APPLICATION NUMBER: 60/174,485 

PRIOR FILING DATE: 2000-01-04 
; PRIOR APPLICATION NUMBER: 10/403,676 
; PRIOR FILING DATE: 2003-03-31 
; PRIOR APPLICATION NUMBER: 60/371,002 
; PRIOR FILING DATE: 2002-04-09 
; PRIOR APPLICATION NUMBER: 60/384,798 
; PRIOR FILING DATE: 2002-05-30 
; PRIOR APPLICATION NUMBER: 60/402,4 07 
; PRIOR FILING DATE: 2002-08-09 

PRIOR APPLICATION NUMBER: 60/443,062 
; PRIOR FILING DATE: 2003-01-28 
; NUMBER OF SEQ ID NOS : 58 

SOFTWARE: CuraSeqList version 0.1 
; SEQ ID NO 30 
; LENGTH: 981 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-10-449-548-30 

Query Match 93.8%; Score 5113.5; DB 15; Length 981; 

Best Local Similarity 94.6%; Pred. No. 0; 

Matches 974; Conservative 0; Mismatches 1; Indels 55; Gaps 1; 
Qy 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 




Db 



4 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 63 



Qy 



Db 



61 IMIMNGTLYIA7VRDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
64 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 123 



Qy 



121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 180 




Db 



124 IKVLLKKNDD7VLFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALF7VDG 183 



Qy 



Db 



181 KLYSAT VTDFLAI DAVI YRSLGES PTLRTVKHDSKWLKEP YFVQAVDYGDYI YFFFREIA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
184 KLYSATWDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEP YFVQAVDYGDYI YFFFREIA 243 



Qy 



Db 



241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
244 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 303 



Qy 



301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 




Db 



304 DVI RI KGRDWLATFSTP YNS I PGSAVCAYDMLDI ASVFTGRFKEQKS PDSTWTPVPDER 363 



Qy 

Db 



361 
364 



VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 423 



Qy 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 424 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 483 

Qy 4 81 GVEDKRIMGMQLDRAS S SLYVAFSTCVI KVPLGRCERHGKCKKTCI ASRDP YCGWI KEGG 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M II I I I I I I I M I I I I I I I I I I I I I I I I I I 
Db 484 GVEDKRIMGMQLDRAS S SLYVAFSTCVI KVPLGRCERHGKCKKTCI ASRDP YCGWI KEGG 543 

Qy 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESR 600 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 544 ACSHLS PNSRLTFEQDI ERGNTDGLGDCHNS FVALN 579 

Qy 601 GGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHDQLVPVTLLAIAVI LAFV 660 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 580 GVI RESYLKGHDQLVPVTLLAIAVI LAFV 608 

Qy 661 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 609 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 668 

Qy 721 AILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERN 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 669 AILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERN 728 

Qy 781 QNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVAQ 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 729 QNL I NACT KDMP PMGS P VI PTDL P LRAS P S H I P S VWL P I TQQG YQH E YVDQ P KMS EVAQ 788 

Qy 841 MALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLSQTGL 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 789 MALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLSQTGL 848 

Qy 901 SKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGDNPP 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 849 SKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGDNPP 908 

Qy 961 PAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKPDVPPKPSFAPLS 1020 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 909 PAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKPDVPPKPSFAPLS 968 

Qy 1021 TSMKPNDACT 1030 

I I I I I I I I I I 
Db 969 TSMKPNDACT 97 8 



RESULT 15 
US-10-403-676-46 

Sequence 46, Application US/10403676 
Publication No. US20040029150A1 
GENERAL INFORMATION: 
APPLICANT: Alsobrook II, John 
APPLICANT: Anderson, David W. 
APPLICANT: Boldog, Ferenc L. 
APPLICANT: Burgess, Catherine E. 
APPLICANT: Casman, Stacie J. 
APPLICANT: Edinger, Shlomit R. 



APPLICANT: Gerlach, Valerie L. 
APPLICANT: Grosse, William M. 
APPLICANT: Guo, Xiaojia 
APPLICANT: Gusev, Vladimir Y. 
APPLICANT: Ji, Weizhen 
APPLICANT: LaRochelle, William J. 
APPLICANT: Lepley, Denise M. 
APPLICANT: Li, Li 
APPLICANT: "Liu, Xiaohong 
APPLICANT: MacDougall, John R. 
APPLICANT: Malyankar, Uriel M. 
APPLICANT: Millet, Isabelle 
APPLICANT: Padigaru, Muralidhara 
APPLICANT: Patturajan, Meera 
APPLICANT: Peyman, John A. 
APPLICANT: Rastelli, Luca 
APPLICANT: Reiger, Daniel 
APPLICANT: Rothenberg, Mark E. 
APPLICANT: Shimkets, Richard A. 
APPLICANT : Stone, David J. 
APPLICANT: Taupier, Raymond J. 
APPLICANT: Vernet, Corine 
APPLICANT: Zerhusen, Bryan D. 

TITLE OF INVENTION: THERAPEUTIC POLYPEPTIDES, NUCLEIC ACIDS ENCODING SAME, 
AND METHODS OF USE 

FILE REFERENCE: 21402-573B 

CURRENT APPLICATION NUMBER: US/10/403, 676 
CURRENT FILING DATE: 2003-03-31 
PRIOR APPLICATION NUMBER: 60/123,667 
PRIOR FILING DATE: 1999-03-09 
PRIOR APPLICATION NUMBER: 09/520,781 
PRIOR FILING DATE: 2000-03-08 
PRIOR APPLICATION NUMBER: 09/957,187 
PRIOR FILING DATE: 2001-09-19 
PRIOR APPLICATION NUMBER: 60/371,002 
PRIOR FILING DATE: 2002-04-09 
PRIOR APPLICATION NUMBER: 60/127,352 
PRIOR FILING DATE: 1999-04-01 
PRIOR APPLICATION NUMBER: 09/538,092 
PRIOR FILING DATE: 2000-03-29 
PRIOR APPLICATION NUMBER: 09/604,286 
PRIOR FILING DATE: 2000-06-22 
PRIOR APPLICATION NUMBER: 60/140,584 
PRIOR FILING DATE: 1999-06-23 
PRIOR APPLICATION NUMBER: 60/370,381 
PRIOR FILING DATE: 2002-04-05 
PRIOR APPLICATION NUMBER: 60/384,297 
PRIOR FILING DATE: 2002-05-30 

Remaining Prior Application data removed - See File Wrapper or PALM. 
NUMBER OF SEQ ID NOS : 179 
SOFTWARE: CuraSeqList version 0.1 
SEQ ID NO 46 
LENGTH: 971 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-403-676-46 



Query Match 93.8%; Score 5109.5; DB 12; Length 971; 

Best Local Similarity 94.2%; Pred. No. 0; 

Matches 970; Conservative 1; Mismatches 0; Indels 59; Gaps 1; 

QY 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

I M II I I I I I I I I I I I I I I | | | | | M I I I I I I I I ! I I I I I I I I I | | | | | | | | | | | | | | | | 
Db 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

QY ^l IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

I I I I I M I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | 
Db 61 IMIMNGTLYIAARDHI YTVDI DTSHTEEI YCS KKLTWKSRQADVDTCRMKGKHKDECHNF 120 



QY 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 180 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I | | | | | | | | | | | M I 
Db 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 180 

QY 181 KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

N I I I I I I I I I I I I II I I I I I II I I I I I | | I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 24 0 



QY 241 VEYNTMGI^WFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

I I I I I M I I I I I I I || || | | | | | | | : | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKRWTSFLKARLNCSVPGDSHFYFNILQAVT 300 



Db 241 



QY 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I | | | | | | | | | | 
Db 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

QY 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I | | | | 
Db 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVR 417 



Qy 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 480 

I I I I 

CSYD 



Db 418 



Qy 481 GVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 540 

I I I I I I I I I I I I I I I I I I | | | | | | | | I I I I I I I I I I I I I I | | | | I I I I I I I I I I I I I I I I 
Db 422 GVEDKRIMGMQLDRAS S SLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 4 81 

QY 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESR 600 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I 
Db 482 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESR 541 

QY 601 GGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRESYLKGHDQLVPVTLLAIAVI LAFV 660 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 542 GGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRESYLKGHDQLVPVTLLAIAVI LAFV 601 

QY 661 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 720 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 602 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 661 

QY 721 AILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERN 7 80 

I I M I I II I I I I I I I I I I I I I | | | | | | | | || | | | | | | | | | | | | | | || | || 

Db 662 AILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERN 721 



Qy 



781 QNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVAQ 840 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 

Db 722 QNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVAQ 781 

Qy 841 MALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLSQTGL 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I II I I I I I I I I 
Db 782 MALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLSQTGL 841 

Qy 901 SKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGDNPP 960 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I || I I I I I I I I 
Db 842 SKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGDNPP 901 

Qy 961 PAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKPDVPPKPSFAPLS 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 902 PAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKPDVPPKPSFAPLS 961 

Qy 1021 TSMKPNDACT 1030 

I I I I I I I I I I 
Db 962 TSMKPNDACT 971 



Search completed: March 24, 2004, 13:19:31 
Job time : 77.7731 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: March 24, 2004, 13:11:03 ; Search time 71.9691 Seconds 

(without alignments) 
4515.598 Million cell updates/sec 

Title: US-09-856-681A-2 
Perfect score: 5450 

Sequence: 1 MRSEALLLYFTLLHFAGAGF PPKPSFAPLSTSMKPNDACT 1030 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1017041 seqs, 315518202 residues 

Total number of hits satisfying chosen parameters: 1017041 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : SPTREMBL 25:* 



1: 


sp archea:* 


2: 


sp bacteria:* 


3: 


sp fungi:* 


4: 


sp_human: * 


5: 


sp_invertebrate : * 


6: 


sp mammal : * 


7: 


sp__mhc: * 


8: 


sp organelle:* 


9: 


sp_phage: * 


10 


: sp_plant:* 


11 


sp rodent:* 


12 


sp_virus : * 


13 


sp_vertebrate : * 


14 


sp_unclassified: * 


15 


sp rvirus:* 


16: 


sp bacteriap:* 


17: 


sp_archeap: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
Q9EQ71 

ID Q9EQ71 PRELIMINARY 
AC Q9EQ71; 

DT 01-MAR-2001 (TrEMBLrel . 
DT 01-MAR-2001 (TrEMBLrel. 
DT 01-OCT-2003 (TrEMBLrel. 



PRT; 1005 AA. 
16, Created) 

16, Last sequence update) 
25, Last annotation update) 



DE Axon guidance signal SEMA6A1. 

GN SEMA6A. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=20564339; PubMed=109938 94 ; 

RA Klostermann A. , Lutz B., Gertler F., Behl C; 

RT "The orthologous human and murine semaphorin 6A-1 proteins (SEMA6A- 

RT l/Sema6A-l) bind to the Enabled/Vasodilator-stimulated Phosphoprotein- 

RT like Protein (EVL) via a novel carboxyl-terminal Zyxin-like domain."; 

RL J. Biol. Chem. 275:39647-39653(2000). 

DR EMBL; AF288666; AAG29494.1; 

DR MGD; MGI: 1203727; Sema6a. 

DR GO; GO: 0016021; C: integral to membrane; ISS. 

DR GO; GO: 0008580; F : cytoskeletal regulator activity; ISS. 

DR GO; GO: 0007411; P : axon guidance; ISS. 

DR GO; GO: 0007166; P:cell surface receptor linked signal transdu. . .; ISS. 

DR GO; GO: 0007399; P : neurogenesis ; ISS. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

SQ SEQUENCE 1005 AA; 111758 MW; 57B69927F45B079D CRC64; 

Query Match 92.2%; Score 5024.5; DB 11; Length 1005; 

Best Local Similarity 91.9%; Pred. No. 0; 

Matches 948; Conservative 26; Mismatches 30; Indels 27; Gaps 2; 

Qy 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MRPAALLLCLTLLHCAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

Qy 61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

I I I I I I I I : I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 IMIMNRTLYVAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

Qy 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHJWALFADG 180 

I I I I I I I I I I I I I I I I I I I I. I I I I I I : : I I I I I I I I I I I I I I I I I I I I I I : I I I I I I 
Db 121 IKVLLKKNDDTLFVCGTNAFNPSCRNYRVDTLETFGDEFSGMARCPYDAKHVNIALFADG 180 

Qy 181 KLYSATVTDFLAI DAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 KLYSATVTDFLAI DAVIYRSPGDSPTLRTVKHDSKWLKEPYFVQAVDYGDYI YFFFREIA 240 

Qy 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLK7VRLNCSVPGDSHFYFNILQAVT 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 



QY 
Db 



301 
301 



DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIADVFTGRFKEQKSPDSTWTPVPDER 



360 
360 



Qy 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 VPKPRPGCCAGS S S LEKYATSNEFPDDTLNFI KTHPLMDEAVPS I INRPWFLRTMVRYRL 420 

Qy 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 480 

I I I I I I I I I I I I I I I I I I I I I I : I I I II I I I I 

Db 421 T KI AVDNAAG P YQNHT W FLEEMNVYNPEKCSYD 454 

Qy 481 GVEDKRIMGMQLDRASSSLYVAFSTCVIKVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 540 

I I I > I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I M I I I I I I I : : I I 
Db 455 GVEDKRIMGMQLDRASGSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWVRESG 514 

Qy 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESR 600 

• I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I : I | II I I : : : I I I I I 
Db 515 SCAHLSPLSRLTFEQDIERGNTDGLGDCHNSFVALNGHASSLYPNTTTSDSASRDGYESR 574 

QY 601 GGMLDWKHLLDS PDSTDPLGAVS SHNHQDKKGVI RES YLKGHDQLVPVTLLAIAVT LAFV 660 

I I I I I I I I : : I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I II I II I 
Db 575 GGMLDWNDLLEAPGSTDPLGAVS SHNHQDKKGVI RES YLKSNDQLVPVTLLAI AVI LAFV 634 

Qy 661 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 720 

I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I :: I I I I I I I I I I I I I I I I 
Db 635 MGAVFSGIIVYCVCDHRRKDVAWRRKEKELTHSRRGSMSSVSELSGLFGDTQSKDPKPE 694 

Qy 721 AILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERN 780 

I I M I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I | | | | | | | | | | | : | | | | | | | | | 
Db 695 AI LTPLMHNGKLATS SNTAKMLI KADQHHLDLTALPTPESTPTLQQKRKPNRGSREWERN 754 

Qy 781 QNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSE-VA 839 

I I : I I M I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I || 
Db 755 QNIINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEWA 814 

Qy 84 0 QMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLSQTG 8 99 

I I I I I II I II I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 815 QMALEDQAATLEYKTTKEHLSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGTSLSQTG 874 

Qy 900 LSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGDNP 959 

I I I I I M I I I I I I :: I I I M I I I I I I I I II II I I I I I I I I II I I I I I I I I | | | | | | | | 
Db 875 LSKRLEMQHSSSYGLEYKRSYPTNSLTRSHQTTTLKRNNTNSSNSSHLSRNQSFGRGDNP 934 

Qy 960 PPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKPDVPPKPSFAPL 1019 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 935 PPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKPDVPPKPSFAPL 994 

Qy 1020 STSMKPNDACT 1030 

I I I I I I I I I I I 
Db 995 STSMKPNDACT 1005 



RESULT 2 
Q96SW4 

ID Q96SW4 PRELIMINARY; PRT; 699 AA. 

AC Q96SW4; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 
DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 
DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 
DE Hypothetical protein FLJ14595. 



OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Isogai T., Ota T., Hayashi K., Sugiyama T., Otsuki T., Suzuki Y. , 

RA Nishikawa T. f Nagai K., Sugano S., Shiratori A., Sudo H., 

RA Wagatsuma M. , Hosoiri T. , Kaku Y. r Kodaira H., Kondo H., Sugawara M. , 

RA Takahashi M. r Chiba Y. , Ishida S., Murakawa K., Ono Y., Takiguchi S., 

RA Watanabe S., Kimura K. , Murakami K. , Ishii S., Kawai Y. r Saito K. , 

RA Yamamoto J., Wakamatsu A., Nakamura Y., Nagahari K., Masuho Y., 

RA Ninomiya K. , Iwayanagi T.; 

RT "NEDO human cDNA sequencing project."; 

RL Submitted (MAY-2 001) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AK027501; BAB55158.1; -. 

DR GO; GO: 0007275; P : development ; IEA. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00423; PSI; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 699 AA; 76723 MW; 2E5F111D59741394 CRC64; 

Query Match 67.5%; Score 3677; DB 4; Length 699; 

Best Local Similarity 100.0%; Pred. No. 4.5e-291; 

Matches 699; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 332 MLDIASVFTGRFKEQKSPDSTWTPVPDERVPKPRPGCCAGSSSLERYATSNEFPDDTLNF 391 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MLDIASVFTGRFKEQKSPDSTWTPVPDERVPKPRPGCCAGSSSLERYATSNEFPDDTLNF 60 

Qy 392 IKTHPLMDEAVPSIFNRPWFLRTMVRYRLTKIAVDTAAGPYQNHTWFLGSEKGIILKFL 451 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 61 I KTHPLMDEAVPS I FNRPWFLRTMVRYRLTKI AVDTAAGP YQNHTWFLGSEKGI I LKFL 120 

Qy 452 ARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVIKVP 511 

I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 121 ARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVP 180 

Qy 512 LGRCERHGKCKKTCIASRDPYCGWIKEGGACSHLSPNSRLTFEQDIERGNTDGLGDCHNS 571 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 LGRCERHGKCKKTCIASRDPYCGWIKEGGACSHLSPNSRLTFEQDIERGNTDGLGDCHNS 240 

Qy 572 FVALNGHSSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKK 631 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 241 FVALNGHSSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKK 300 

Qy 632 GVIRESYLKGHDQLVPVTLLAIAVILAFVMGAVFSGITVYCVCDHRRKDVAWQRKEKEL 691 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 301 GVI RES YLKGHDQLVPVT LLAI AVI LAFVMGAVFS GI TVYCVCDHRRKDVAWQRKEKEL 360 

Qy 692 THSRRGSMSSVTKLSGLFGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLD 7 51 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Ml I I I I I I I I II I I I I I I I I I I I I I I I 
Db 361 THSRRGSMSSVTKLSGLFGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLD 420 



QY 



752 



LTALPTPESTPTLQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVI PTDLPLRASPSH 811 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 

Db 421 LTALPTPESTPTLQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSH 480 

Qy 812 I PS VWLP I TQQGYQHE YVDQPKMS EVAQMALEDQAATLE YKT I KEHLS S KS PNHGVNLV 871 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 I P S VWLP I TQQGYQHE YVDQPKMS EVAQMALEDQAAT LE YKT I KEHLS S KS PNHGVNLV 540 

Qy 872 ENLDSLPPKVPQREASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQA 931 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 541 ENLDSLPPKVPQREASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQA 600 

Qy 932 TTLKRNNTNSSNSSHLSRNQSFGRGDNPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNA 991 

I I I I I I I I I I I I I I II I I I I I M I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I 
Db 601 TT LKRNNTN SSNSSHLS RNQS FGRGDN P P PAPQRVD S I QVH S S Q P S GQAVT VS RQ P S LNA 660 

Qy 992 YNSLTRSGLKRTPSLKPDVPPKPSFAPLSTSMKPNDACT 1030 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 661 YNSLTRSGLKRTPSLKPDVPPKPSFAPLSTSMKPNDACT 699 



RESULT 3 
Q96SM8 

ID Q96SM8 PRELIMINARY; PRT; 574 AA. 

AC Q96SM8; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel . 24, Last annotation update) 

DE Hypothetical protein FLJ14748. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBIJTaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Isogai T., Ota T., Hayashi K., Sugiyama T., Otsuki T., Suzuki Y., 

RA Nishikawa T., Nagai K., Sugano S., Shiratori A., Sudo H., 

RA Wagatsuma M. , Hosoiri T., Kaku Y. , Kodaira H., Kondo H. , Sugawara M. , 

RA Takahashi M. , Chiba Y., Ishida S., Murakawa K., Ono Y., Takiguchi S., 

RA Watanabe S., Kimura K., Murakami K. , Ishii S., Kawai Y. , Saito K., 

RA Yamamoto J., Wakamatsu A., Nakamura Y., Nagahari K., Masuho Y., 

RA Ninomiya K., Iwayanagi T.; 

RT "NEDO human cDNA sequencing project."; 

RL Submitted (MAY-2001) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AK027654; BAB55269.1; 

DR GO; GO: 0007275; P : development ; IEA. 

DR InterPro; IPR003659; Plexin-like. 

DR SMART; SM00423; PSI; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 574 AA; 62822 MW; 0C79E01A4117A495 CRC64 ; 

Query Match 54.7%; Score 2980.5; DB 4; Length 574; 

Best Local Similarity 90.7%; Pred. No. 2.5e-234; 

Matches 574; Conservative 0; Mismatches 0; Indels 59; Gaps 1; 

Qy 398 MDEAVPSIFNRPWFLRTMVRYRLTKIAVDTAAGPYQNHTVVFLGSEKGIILKFLARIGNS 457 

I I I I I I I I I I I I I I I I I I I I 

Db 1 MDEAVP S I FNRPWFLRTMVR 20 



Qy 

Db 



458 
21 



GFLNDSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVIKVPLGRCER 517 

I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

CSYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCER 61 



Qy 518 HGKCKKTCIASRDPYCGWIKEGGACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNG 577 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

Db 62 HGKCKKTCIASRDPYCGWIKEGGACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNG 121 

Qy 578 HSSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RES 637 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I II I I I I I 
Db 122 HSSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRES 181 

Qy 638 YLKGHDQLVPVTLLAIAVILAFVMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRG 697 

I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I II M I I I I I I I I I I I I I I I II I I II I I I 
Db 182 YLKGHDQLVPVTLLAIAVI LAFVMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRG 241 

Qy 698 SMSSVTKLSGLFGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPT 757 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
Db 242 SMS SVTKLSGLFGDTQSKDPKPEAI LTPLMHNGKLATPGNTAKMLI KADQHHLDLTALPT 301 

Qy 758 PESTPTLQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSWV 817 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 302 PESTPTLQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSWV 361 

Qy 818 LP I TQQGYQHEYVDQPKMS EVAQMALEDQAATLEYKT I KEHLS S KS PNHGVNLVENLDS L 877 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 362 LP I TQQGYQHEYVDQPKMS EVAQMALEDQAATLEYKT I KEHLS S KS PNHGVNLVENLDS L 421 

Qy 87 8 PPKVPQREASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRN 937 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

Db 422 PPKVPQREASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRN 4 81 

Qy 938 NTNSSNSSHLSRNQSFGRGDNPP P AP QRVD S I QVH S S Q P S GQ AVT VS RQ P S LNAYN S LT R 997 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 482 NTN SSNSSHLS RNQ S FGRGDN P P PAPQRVDS I QVH S SQ P S GQAVT VS RQP S LNAYN S LT R 541 

Qy 998 SGLKRTPSLKPDVPPKPSFAPLSTSMKPNDACT 1030 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 542 SGLKRTPSLKPDVPPKPSFAPLSTSMKPNDACT 574 



PRELIMINARY; 

(TrEMBLrel. 
(TrEMBLrel. 
(TrEMBLrel. 



23, 
23, 
24, 



RESULT 4 
Q8BUT0 

ID Q8BUT0 

AC Q8BUT0; 

DT 01-MAR-2003 

DT 01-MAR-2003 

DT 01-JUN-2003 

DE Sema domain. 

GN A730020P05RIK. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; 

OC Mammalia; Eutheria; 

OX NCBI_TaxID-10090; 

RN [1] 

RP SEQUENCE FROM N . A. 



PRT; 



587 AA. 



Created) 

Last sequence update) 
Last annotation update) 



Craniata; Vertebrata; Euteleostomi ; 
Rodentia; Sciurognathi; Muridae; Murinae; Mus. 



RC STRAIN-C57BL/ 6J; TISSUE=Cerebellum; 

RX MEDLINE=22354 683; PubMed=124 66851 ; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

DR EMBL; AK082711; BAC38582.1; -. 

DR MGD; MGI:2443196; A7 3002 OPOSRik . 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00630; Sema; 1. 

SQ SEQUENCE 587 AA; 66044 MW; EB9C7B102C4DB97A CRC64; 

Query Match 54.4%; Score 2963; DB 11; Length 587; 

Best Local Similarity 95.1%; Pred. No. 7e-233; 

Matches 549; Conservative 13; Mismatches 15; Indels 0; Gaps 0; 

Qy 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

II I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I II I I II I I I I I I I I M I I I I 
Db 1 MRPAALLLCLTLLHCAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

Qy 61 IMIMNGTLYI7V7VRDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

I I I I I I I I : I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 

Db 61 IMIMNRTLYVAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

Qy 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 180 

[ I I I I I I I I I I I I I I I I I I I I I I I I I : : I I I I M I I I I I I I I I I I I I I I I I : I I I I I I 
Db 121 IKVLLKKNDDTLFVCGTNAFNPSCRNYRVDTLETFGDEFSGMARCPYDAKHANI7VLFADG 180 

Qy 181 KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

I M I I I I I I I I I I I I I I I I I I I : I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 181 KLYSATVTDFLAIDAVIYRSLGDSPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

Qy 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLK7VRLNCSVPGDSHFYFNILQAVT 300 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

Qy 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIANVFTGRFKEQKSPDSTWTPVPDER 360 

Qy 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 361 VPKPRPGCCAGSSSLEKYATSNEFPDDTLNFIKTHPLMDEAVPSIINRPWFLRTMVRYRL 420 

Qy 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 480 

I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I : I I I I I I I II I II : I II I I I I I I 
Db 421 TKIAVDNAAGPYQNHTWFLGSEKGIILKFLARIGSSGFLNGSLFLEEMNVYNPEKCSYD 4 80 

Qy 481 GVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIAS RDPYCGWIKEGG 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : : I I 
Db 481 GVEDKRIMGMQLDRASGSLYVAFSTCVIKVPLGRCERHGKCKKTCIASRDPYCGWVRESG 54 0 

Qy 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNG 577 

: I : I I I I I I I II II I I II I I I I I I I I I I I I I I I I I I 
Db 541 SCAHLSPLSRLTFEQDIERGNTDGLGDCHNSFVALNG 577 



RESULT 5 
Q8BXZ7 



ID Q8BXZ7 PRELIMINARY; PRT; 605 AA. 

AC Q8BXZ7 ; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Sema domain . 

GN A730020P05RIK. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TI SSUE=Cerebellum; 

RX MEDLINE=22354683; PubMed=12466851 ; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,77 0 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

DR EMBL; AK042751; BAC31351.1; 

DR MGD; MGI:2443196; A730020P05Rik . 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00630; Sema; 1. 

SQ SEQUENCE 605 AA; 68299 MW; B4 FE7 13BE8AF6E90 CRC64 ; 

Query Match 54.3%; Score 2961; DB 11; Length 605; 

Best Local Similarity 94.5%; Pred. No. l.le-232; 

Matches 550; Conservative 14; Mismatches 18; Indels 0; Gaps 0; 

Qy 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

II I I I I I I I I I I I I I II II I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 1 MRPAALLLCLTLLHCAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

Qy 61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 IMIMNRTLYVAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

Qy 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 180 

I I I I I I I I I I I I I I I I I I I I II I I I I : : I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I 
Db 121 IKVLLKKNDDTLFVCGTNAFNPSCRNYRVDTLETFGDEFSGMARCPYDAKHANIALFADG 180 

Qy 181 KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 KLYSATVTDFLAIDAVIYRSLGDSPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

Qy 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLK7VRLNCSVPGDSHFYFNILQAVT 300 

Qy 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I 



Db 



301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIANVFTGRFKEQKSPDSTWTPVPDER 360 



Qy 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I 

Db 361 VPKPRPGCCAGSSSLEKYATSNEFPDDTLNFIKTHPLMDEAVPSIINRPWFLRTMVRYRL 420 

Qy 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 480 

I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I : I I I I I llllllhlll I I I I I I 
Db 421 TKIAVDNAAGPYQNHTWFLGSEKGIILKFLARIGSSGFLNGSLFLEEMNVYNPEKCSYD 480 

Qy 481 GVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 540 

I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : : I I 
Db 481 GVEDKRIMGMQLDRASGSLYVAFSTCVIKVPLGRCERHGKCKKTCIASRDPYCGWVRESG 540 

Qy 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSL 582 

: I : I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I : I 
Db 541 SCAHLSPLSRLTFEQDIERGNTDGLGDCHNSFVALNDISTPL 582 



RESULT 6 
Q96SY4 

ID Q96SY4 PRELIMINARY; PRT; 562 AA. 

AC Q96SY4; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Hypothetical protein FLJ14565. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Isogai T., Ota T., Hayashi K., Sugiyama T., Otsuki T., Suzuki Y., 

RA Nishikawa T., Nagai K., Sugano S., Shiratori A., Sudo H . , 

RA Wagatsuma M. , Hosoiri T., Kaku Y. , Kodaira H. , Kondo H . , Sugawara M. , 

RA Takahashi M. , Chiba Y., Ishida S., Murakawa K., Ono Y., Takiguchi S., 

RA Watanabe S., Kimura K., Murakami K., Ishii S., Kawai Y., Saito K., 

RA Yamamoto J., Wakamatsu A., Nakamura Y., Nagahari K. , Masuho Y., 

RA Ninomiya K. , Iwayanagi T.; 

RT "NEDO human cDNA sequencing project."; 

RL Submitted (MAY-2001) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; AK027471; BAB55136.1; -. 

DR GO; GO: 0007275; P : development ; IEA. 

DR InterPro; IPR003659; Plexin-like. 

DR SMART; SM00423; PSI; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 562 AA; 61313 MW; 6AB3685FAD1DD7 8A CRC64; 

Query Match 54.2%; Score 2952; DB 4; Length 562; 

Best Local Similarity 100.0%; Pred. No. 5.2e-232; 

Matches 562; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 4 69 MSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIAS 52 8 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M II I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MSVYNSEKCS YD GVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIAS 60 



Qy 

Db 



529 
61 



RDPYCGWIKEGGACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTT 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

RDPYCGWIKEGGACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTT 



588 
120 



Qy 589 SDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRESYLKGHDQLVPV 648 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 SDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHDQLVPV 180 

Qy 649 TLLAIAVI LAFVMGAVFSGITWCVCDHRRKDVAVVQRKEKELTHSRRGSMSSVTKLSGL 708 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 TLLAIAVILAFVMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGL 240 

Qy 709 FGDTQSKDPKPEAI LTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKR 768 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 FGDTQSKDPKPEAI LTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKR 300 

Qy 769 KPSRGSREWERNQNLINACTKDMPPMGSPVI PTDLPLRASPSHI PSVWLPITQQGYQHE 828 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 KPSRGSREWERNQNLINACTKDMPPMGSPVI PTDLPLRASPSHI PSVWLPITQQGYQHE 360 

Qy 829 YVDQPKMS EVAQMALEDQAATLEYKTI KEHLS SKS PNHGVNLVENLDSLP PKVPQREASL 888 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 YVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASL 420 

Qy 8 89 GPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLS 94 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 GPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLS 4 80 

Qy 949 RNQSFGRGDNPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKP 100£ 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
Db 481 RNQSFGRGDNPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKP 540 

Qy 1009 DVPPKPSFAPLSTSMKPNDACT 1030 

I I I I I I I I I I I I I II I I I I I I I 
Db 541 DVPPKPSFAPLSTSMKPNDACT 562 



RESULT 7 
Q8NC49 

ID Q8NC49 PRELIMINARY; PRT; 562 AA. 

AC Q8NC4 9; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22 r Last sequence update) 

DT 01-OCT-2002 (TrEMBLrel. 22 f Last annotation update) 

DE Hypothetical protein FLJ90494. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Isogai T., Ota T., Nishikawa T., Hayashi K., Otsuki T., Sugiyama T., 

RA Suzuki Y., Nagai K. f Sugano S., Ishii S., Kawai-Hio Y-, Saito K., 

RA Yamamoto J., Wakamatsu A., Nakamura Y., Kojima S., Nagahari K., 

RA Masuho Y., Ono T., Okano.K., Yoshikawa Y., Aotsuka S., Sasaki N . , 

RA Hattori A., Okumura K. f Iwayanagi T., Ninomiya K. ; 

RT "NEDO human cDNA sequencing project."; 



RL Submitted (MAR-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AK074975; BAC11326.1; -. 

KW Hypothetical protein. 

SQ SEQUENCE 562 AA; 61286 MW; 708041459E34D78A CRC64; 



Query Match 54.1%; Score 2 947; DB 4; Length 562; 

Best Local Similarity 99.8%; Pred. No. 1.3e-231; 

Matches 561; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 469 MSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIAS 528 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I 
Db 1 MSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIAS 60 

Qy 529 RDPYCGWIKEGGACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTT 588 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I II 
Db 61 RDPYCGWI KEGGACSHLS PNSRLT FEQDI ERGNTDGLGDCHNS FVALNGHS SSLLPSTTT 12 0 

Qy 589 SDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHDQLVPV 648 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I 
Db 121 SDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHDQLVPV 180 

Qy 649 TLLAI AVI LAFVMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHS RRGSMS SVTKLSGL 708 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 TLLAI AVI LAFVT4GAVFSGITVYCVCDHRRKDVAVVQRKEKELTHS RRGSMS SVTKLSGL 240 

Qy 709 FGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKR 768 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 FGDTQSKDPKPEAILTPLMHNGKLATPGNTAPCMLIKADQHHLDLTALPTPESTPTLQQKR 300 

Qy 769 KPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHE 828 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 KPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHE 360 

Qy 829 YVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASL 888 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 
Db 361 YVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASL 420 

Qy 889 GPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLS 948 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 GPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLS 4 80 

Qy 949 RNQSFGRGDNPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKP 1008 

I II II I I I I : I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 

Db 481 RNQSFGRGDSPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKP 540 

Qy 1009 DVPPKPSFAPLSTSMKPNDACT 1030 

I I I I I I I I I I I I I II I I I I I I I 
Db 541 DVPPKPSFAPLSTSMKPNDACT 562 



RESULT 8 
Q96T04 

ID Q96T04 PRELIMINARY; PRT; 507 AA. 

AC Q96T04; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 



DE Hypothetical protein FLJ14533. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Isogai T., Ota T., Hayashi K., Sugiyama T., Otsuki T., Suzuki Y., 

RA Nishikawa T., Nagai K. , Sugano S., Shiratori A., Sudo H., 

RA Wagatsuma M., Hosoiri T., Kaku Y. , Kodaira H., Kondo H., Sugawara M. , 

RA Takahashi M., Chiba Y. , Ishida S., Murakawa K., Ono Y., Takiguchi S., 

RA Watanabe S., Kimura K. , Murakami K., Ishii S., Kawai Y. f Saito K. , 

RA Yamamoto J., Wakamatsu A., Nakamura Y., Nagahari Masuho Y., 

RA Ninomiya K. , Iwayanagi T.; 

RT "NEDO human cDNA sequencing project."; 

RL Submitted (MAY-2001) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AK027439; BAB55111.1; -. 

DR GO; GO: 0007275; P : development ; IEA. 

DR InterPro; IPR003659; Plexin-like. 

DR SMART; SM00423; PSI; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 507 AA; 55464 MW; 8CC567B4 38C51B39 CRC64; 



Query Match 48.1%; Score 2621.5; DB 4; Length 507; 

Best Local Similarity 90.2%; Pred. No. 4.3e-205; 

Matches 507; Conservative 0; Mismatches 0; Indels 55; Gaps 1; 

Qy 469 MSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVIKVPLGRCERHGKCKKTCIAS 528 

I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVIKVPLGRCERHGKCKKTCIAS 60 

Qy 529 RDPYCGWIKEGGACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTT 588 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 61 RDPYCGWI KEGGACSHLS PNSRLTFEQDI ERGNTDGLGDCHNS FVALN 108 

Qy 589 SDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHDQLVPV 648 

I I II I I I I I I I I I I I I I 

Db 109 GVI RESYLKGHDQLVPV 125 

Qy 64 9 TLLAIAVI LAFVMGAVFSGITVTCVCDHRRKDVAVVQRKEKELTHSRRGSMSSvTKLSGL 7 08 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 126 TLLAIAVI LAFVMGAVFSGITVYCVCDHRRKDVAVVQRKEKELTHSRRGSMSSvTKLSGL 185 

Qy 709 FGDTQSKDPKPEAILTPLMHNGKLATPGNT7VKMLIKADQHHLDLTALPTPESTPTLQQKR 768 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 186 FGDTQSKDPKPEAI LTPLMHNGKLATPGNTAKMLIKADQHHLDLT7VLPTPESTPTLQQKR 245 

Qy 769 KPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHE 828 

I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 24 6 KPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHE 305 

Qy 829 YVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASL 888 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I II 
Db 306 YVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASL 365 



QY 



889 GPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLS 94 8 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 



Db 



366 GPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLS 425 



Qy 949 RNQSFGRGDNPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKP 1008 

I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I II I I I I I I I I I I I I I I I I M I I I 

Db 426 RNQSFGRGDNPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKP 485 

Qy 1009 DVPPKPSFAPLSTSMKPNDACT 1030 

II I I I I I I I I I I I I I I I I I I I I 

Db 486 DVPPKPSFAPLSTSMKPNDACT 507 



RESULT 9 
Q8NFY4 
ID Q8NFY4 
Q8NFY4; 
01-OCT-2002 
01-OCT-2002 
01-MAR-2003 



P REL I MI NARY ; 



PRT; 1073 AA. 



AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RC 
RA 
RT 
RL 
DR 
DR 
DR 
DR 
SQ 



(TrEMBLrel. 22, Created) 
(TrEMBLrel. 22, Last sequence update) 
(TrEMBLrel. 23, Last annotation update) 
Semaphorin 6D isoform 4. 
SEMA6D. 

Homo sapiens (Human) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
NCBI_TaxID=9606; 
[1] 

SEQUENCE FROM N.A. 
TISSUE=Brain; 

Qu X., Zhai Y. , Wei H., Yu Y., Tang F., He F. ; 

"Homo sapiens semaphorin 6D isoform 4 (SEMA6D.4) mRNA, complete cds."; 

Submitted (JUN-2001) to the EMBL/GenBank/DDBJ databases. 

EMBL; AF389429; AAM69452.1; 

InterPro; IPR001627; Sema. 

Pfam; PF01403; Sema; 1. 

SMART; SM00630; Sema; 1. 

SEQUENCE 1073 AA; 119872 MW; 7DCE4DFC5BF7 0F9E CRC64 ; 



Query Match 43.4%; Score 2368; DB 4; Length 1073; 

Best Local Similarity 46.0%; Pred. No. 7.5e-184; 

Matches 512; Conservative 159; Mismatches 308; Indels 134; 



Gaps 30; 



Qy 


l 


Db 


i 


Qy 


58 


Db 


59 


Qy 


118 


Db 


119 


Qy 


178 


Db 


179 


Qy 


238 



MRSEALLLYFTLL- 
II I I I I 



-HFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLD 57 
MM II:: : | : : I | I I I I : I I :| I III 



IQMIMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDEC 117 

I : :: : I I I I I I I : I I I : : : I : : I I I I I : I I I I : I II I I II I I I 

FQLMLKIRDTLYIAGRDQVYTVNLNEMPKTEVIPNKKLTWRSRQQDRENCAMKGKHKDEC 118 

HNFIKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALF 17 7 

I I I I I I : : I I : : I I I I I I I I M I I I : : Ml I : I I I : I I I I : I I : I I I I I 

HNFIKVFVPRNDEMVFVCGTNAFNPMCRYYRLSTLEYDGEEISGLARCPFDARQTNVALF 17 8 

ADGKLYSATVTDFLAIDAVIYRSLGESPTLRTWHDSKWLKEPYFVQAVDYGDYIYFFFR 237 

llllllllll I I I I IMMM:|: | | | : | : | | | I : | I I : I : I : : I I : I : I I I I I 

7VDGKLYSATVADFLASDAVIYRSMGDGSALRTIKYDSKWIKEPHFLHAIEYGNYVYFFFR 238 

EIAVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQ 2 97 



I I I I I ' I • I I I • II I I II II II [ I II II I I I I I I I I i I I II i I i I I I I I • • I I 

239 EIAVEHNNLGKAVYSRVARICKNDMGGSQRVLEKHWTSFLKARLNCSVPGDSFFYFDVLQ 298 

298 AVTDVIRINGRDVVLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVP 357 

: : I I : I : I I I I : hi I I I I I I I I I I : I I I I I I I I I I I I : I I I I I I I 

2 99 SITDIIQINGIPTWGVFTTQLNSIPGSAVCAFSMDDIEKVFKGRFKEQKTPDSVWTAVP 358 

358 DERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVR 417 

: : : I I I I I I I I M I I II : I I I : I I : I I I : I I I I I III I : III : I M 

359 EDKVPKPRPGCCAKHGLAEAYKTSIDFPDETLSFIKSHPLMDSAVPPIADEPWFTKTRVR 418 

418 YRLTKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGF-LNDSLFLEEMSVYNSEK 476 

I I I I I : I I : I I I I I I : I I : I : I I I I : : I I I I : I I I I I I : III: II I 

419 YRLTAI SVDHSAGPYQNYTVI FVGSEAGMVLKVLAK — TSPFSLNDSVLLEEIEAYNHAK 476 

477 CSYDGVEDKRIMGMQLDRASSSLYVAFSTCVIKVPLGRCERHGKCKKTCIASRDPYCGWI 536 

II : III::: : I I I : : I I I I I I : I : I : : I I I I I I : I I I I : I I I I I I I I I I I : 

477 CSAENEEDKKVI SLQLDKDHHALYVAFSSCI IRI PLSRCERYGSCKKSCIASRDPYCGWL 536 

537 KEGGACSHLSPNSRLT-FEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQE 595 

: I : I : : I : I I I I I I I I I I I I : | | : : I I I 
537 SQ-GSCGRVTPGMLAEGYEQDTEFGNTAHLGDCH EILPTSTTPD 579 

596 GYESRGG MLD-WKHLL DSP DSTDP 618 

I : I I :: I I : I II I I If 

580 -YKIFGGPTSDMEVSSSSVTTMASIPEITPKVIDTWRPKLTSSRKFWQDDPNTSDFTDP 638 

619 LGAVSSHNHQDKKGVI RESYLKGHDQLVPVTLLAIAVI LAFVMGAVFSGITVYCVCD-HR 677 

I : Ml I : I : I : : I I I I I : I I : I : I I I I 

639 LSGI P KGWWEVQS GESNQMVHMNVLI TCVFAAFVLGAFIAGVAVYC YRDMFV 691 

678 RKDVAWQRKEKELTHS RRGSMS S VTKLS GLFG DTQSKDPKPEAI LTPLMHNGKLA 733 

||:: 1:1 I I I I I : I I I : I I : I : I 

692 RKNRKI — HKDAESAQSCTDSSGSFAKLNGLFDSPVKEYQQNIDSPKLYSNLLTSRKELP 7 49 

734 TPGNT7VKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERNQNLINACTKDMPP 793 

I : I I : : : I II I I I I II I I I I : I : : : I Ml 

750 PNGDTKSMVMDHRGQPPELAALPTPESTPVLHQKTLQAMKSHSEKAHGH — GASRKETPQ 8 07 

794 MGSPVIPTDLPLRASPSHIPSWVLPITQQGYQHEY VDQP KMS 836 

I 1111111:111 I : : I I I 

808 FFPSSPPPHSPL — SHGHIPSAIVLPNATHDYNTSFSNSNAHKAEKKLQNIDHPLTKSSS 865 

837 EVAQMALEDQAATLEYKTIKEHLSSKSPN HGVNLVENLDSL PPKVP 882 

: | || : : I I : : I I ::: : |: Mill 

866 KRDHRRSVDSRNTL— NDLLKHLNDPNSNPKAIMGDIQMAHQNLMLDPMGSMSEVPPKVP 923 

883 QREASLGPPGASLSQTGLSKRLEMHHSSSY GVDYKRSYPTNSLTRSHQATTLKRNNT 939 

I I I I I I : : I : : I I : : : : : : : I I II II : : : I 

924 NREASLYSPPSTLPRNSPTKRVDVPTTPGVPMTSLERQRGYHKNSSQR-HSISAMPK-NL 981 

940 NSSNSSHLSRNQSFGRGD-NPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSLNAYNSLT — 996 

Ml III I II II : I I I I I I : I MM: : I 

982 NSPNGVLLSRQPSMNRGGYMPTPTGAKVDYIQ GT P VS VHLQ PSLSRQSSYTSN 1034 

997 RSGLKRTPSLKPDVPPKPSFAPLSTSMKP 1025 

I : I I M I I I I II I II II II I I : MM 



Db 1035 GTLPRTGLKRTPSLKPDVPPKPSFVPQTPSVRP 1067 



RESULT 10 
Q8NFY5 

ID Q8NFY5 PRELIMINARY; PRT; 1017 AA. 

AC Q8NFY5; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Semaphorin 6D isoform 3. 

GN SEMA6D. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBIJTaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE-Brain; 

RA Qu X., Wei H., Zhai Y., Yu Y., Tang F., He F. ; 

RT "Homo sapiens semaphorin 6D isoform 3 (SEMA6D.3) mRNA, complete cds . " ; 

RL Submitted (JUN-2001) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AF389428; AAM69451.1; 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00630; Sema; 1. 

SQ SEQUENCE 1017 AA; 113736 MW; 4D639CEBADD9F2A0 CRC64; 

Query Match 43.1%; Score 2349; DB 4; Length 1017; 

Best Local Similarity 46.7%; Pred. No. 2.5e-182; 

Matches 503; Conservative 155; Mismatches 302; Indels 116; Gaps 27; 

Qy 1 MRSEALLLYFTLL HFAGAGFPEDSEPI SISHGNYTKQYPVFVGHKPGRNTTQRHRLD 57 

II I I II I I I I I I : : : I : : I I I I I I : I I : I I I I I 

Db 1 MRVFLLCAYILLLMVSQLRAVSFPEDDEPLNTVDYHYSRQYPVFRG-RPSGNESQ-HRLD 58 

Qy 58 IQMIMIMNGTLYI7^ARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDEC 117 

I : : : : I I I I I I I : I I I : : : I : : I II I I : I I I I : I I I I I I I I I I 

Db 59 FQLMLKI RDTLYIAGRDQVYTVNLNEMPKTEVT PNKKLTWRSRQQDRENCAMKGKHKDEC 118 

Qy 118 HNFIKVljLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALF 177 

I I I I I I : : M : : I I I I I I I I I I II I : : III I : I I I : I I I I : II : I I I I I 
Db 119 HNFIKVFVPRNDEMVFVCGTNAFNPMCRYYRLSTLEYDGEEISGLARCPFDARQTNVALF 17 8 

Qy 178 ADGKLYSATVTDFLAI DAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFR 237 

I I I I I I I I I I I I I I I I I I I I I : I : I I I : I : I I I I : I I I : I : I : : I I : I : I I I I I 
Db 179 ADGKLYSATV7VDFLASDAVIYRSMGDGSALRTIKYDSKWIKEPHFLHAIEYGNYVYFFFR 238 

Qy 238 EIAVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQ 297 

I I I I I : I : I I I : I I I : : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : : I I 
Db 239 EIAVEHNNLGKAVYSRVARICKNDMGGSQRVLEKHWTSFLKARLNCSVPGDSFFYFDVLQ 298 

Qy 298 AVTDVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVP 357 

: : I I : I : I I I I : I : I I I I I I I I I I I : I II II I I I I I I I : I I I II II 
Db 299 SITDIIQINGIPTWGVFTTQLNSIPGSAVCAFSMDDIEKVFKGRFKEQKTPDSVWTAVP 358 

Qy 358 DERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVR 417 



::: I II I I I I I I I I I II : I I I : I I : I I I : I I I I I III I : III : I II 

Db 359 EDKVPKPRPGCCAKHGLAEAYKTSIDFPDETLSFIKSHPLMDSAVPPIADEPWFTKTRVR 418 

Qy 418 YRLTKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGF-LNDSLFLEEMSVYNSEK 476 

I I I I I : I I : I I M I I : I I : I : I I I I : : I I I I : I I I I I I : III: II I 

Db 419 YRLTAISVDHSAGPYQNYTVT FVGSEAGMVLKVLAK — TSPFSLNDSVLLEEIEAYNHAK 476 

Qy 477 CS YDGVEDKRIMGMQLDRAS S SLYVAFSTCVT KVPLGRCERHGKCKKTCIAS RDPYCGWI 536 

II : III::: : I I I : : I I I I I I : I : I : : I I I I I I : I I I I : I I I I I I I I I I I : 

Db 477 CSAENEEDKKVISLQLDKDHHALYVAFSSCIIRIPLSRCERYGSCKKSCIASRDPYCGWL 536 

Qy 537 KEGGACSHLSPNSRLT-FEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQE 595 

: I : I : : I : I I I I I I I I I I I I : I I : : I I I 
Db 537 SQ-GSCGRWPGMLAEGYEQDTEFGNTAHLGDCH EILPTSTTPD 579 

Qy 596 GYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRESYLKGHDQLVPVTLLAIAV 655 

I : I I II III : | : | : : | | 

Db 580 -YKIFGG PTS GVRW EVQ S GE S NQMVHMNVL I T CV 612 

Qy 656 ILAFVMGAVFSGITVYCVCD-HRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFG 710 

111:11 : I : I I I I II : : I : I I I I I I : I I I 
Db 613 FAAFVL GAFI AGVAVYC YRDMFVRKNRK I — HKDAESAQSCTDSSGSFAKLNGLFDSPVK 670 

Qy 711 DTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKP 770 

: I I : I : I I : I I : : : I I I I II I I I I I I I 

Db 671 EYQQNIDSPKLYSNLLTSRKELPPNGDTKSMVMDHRGQPPELAALPTPESTPVLHQKTLQ 7 30 

Qy 771 SRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEY- 829 

: I : : : I I : I I I I I I I I I : I I I I : 

Db 731 AMKSHSEKAHGH — GASRKETPQFFPSSPPPHSPL — SHGHI PSAIVLPNATHDYNTSFS 786 

Qy 830 VDQP KMSEVAQMALEDQAATLEYKTIKEHLSSKSPN 865 

: I I I : I II : : | | : : | 

Db 787 NSNAHKAEKKLQNIDHPLTKSSSKRDHRRSVDSRNTL--NDLLKHLNDPNSNPKAIMGDI 844 

Qy 866 HGVNLVENLDSL PPKVPQREASLGPPGASLSQTGLSKRLEMHHSSSY GVDY 916 

I : : : : I : I I I I I I I I I I I : : I : : I I : : : : : : 

Db 845 QMAHQN LML D PMG SMS EVP P KVPN REAS LYSPPSTLPRNSPTK RVD VP TT P GVPMT S L E R 904 

Qy 917 KRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGD-NPPPAPQRVDSIQVHSSQ 975 

: I I II II : : : I I I I III I II II : I I I I 
Db 905 QRGYHKNSSQR-HSISAMPK-NLNSPNGVLLSRQPSMNRGGYMPTPTGAKVDYIQ 957 

Qy 976 PSGQAVTVSRQPSLNAYNSLT RSGLKRTPSLKPDVPPKPSFAPLSTSMKP 1025 

I 1:1 MM: : I I I : I I I I I I I I I I I I I I I I I I I : I :: I 

Db 958 — GTPVSVHLQPSLSRQSSYTSNGTLPRTGLKRTPSLKPDVPPKPSFVPQTPSVRP 1011 

RESULT 11 
Q8NFY6 

ID Q8NFY6 PRELIMINARY; PRT; 998 AA. 

AC Q8NFY6; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Semaphorin 6D isoform 2. 

GN SEMA6D. 



OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RA Qu X., Zhai Y., Wei H., Yu Y. , Tang F. , He F.; 

RT "Homo sapiens semaphorin 6D isoform 2 (SEMA6D.2) mRNA, complete cds."; 

RL Submitted (JUN-2001) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF389427; AAM69450.1; -. 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00630; Sema; 1. 

SQ SEQUENCE 998 AA; 111730 MW; 3F4 6D6872E8D534 4 CRC64; 



Query Match 42.5%; Score 2315.5; DB 4; Length 998; 

Best Local Similarity 45.8%; Pred. No. 1.3e-179; 

Matches 493; Conservative 155; Mismatches 293; Indels 135; Gaps 26; 

Qy 1 MRSEALLLYFTLL HFAGAGFPEDSEPISI SHGNYTKQYPVFVGHKPGRNTTQRHRLD 57 

II I I II I I I I I I :: : I : : I I I I I I : I I : I I I I I 

Db 1 MRVFLLCAYILLLMVSQLRAVSFPEDDEPLNTVDYHYSRQYPVFRG-RPSGNESQ-HRLD 58 

Qy 58 IQMIMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDEC 117 

I : : : : I I I I I I I : I I I : : : I : : I M I I : I I I I : I I I I I I I I I I 

Db 59 FQLMLKIRDTLYIAGRDQVYTVNLNEMPKTEVIPNKKLTWRSRQQDRENCAMKGKHKDEC 118 

Qy 118 HNFIKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALF 177 

I I I I I I : : I I : : I I I I I I I II I II I : : III hi II : I I I I : I I : I I I I I 
Db 119 HNFIKVFVPRNDEMVFVCGTNAFNPMCRYYRLSTLEYDGEEISGLARCPFDARQTNVALF 178 

Qy 178 ADGKLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFR 237 

I I I I I I I I I I I I I I I I I I I I I : I : I I I : I : I I I I : I I I : I : I : : I I : I : I I I I I 
Db 179 ADGKLYSATVADFLASDAVI YRSMGDGSALRTI KYDSKWI KEPHFLHAI EYGNYVYFFFR 238 

Qy 238 EIAVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQ 297 

I I I I I : I : I I I : I I I : : I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I : : I I 
Db 239 EIAVEHNNLGKAVYSRVARICKNDMGGSQRVLEKHWTSFLKARLNCSVPGDSFFYFDVLQ 298 

Qy 298 AVTDVI RINGRDVVLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVP 357 

::||:|:||| |: |:| I I II I I I I I I : I II I I I I I I I I I : I I I I I I I 
Db 299 S ITDI IQINGI PTVVGVFTTQLNS I PGSAVCAFSMDDI EKVFKGRFKEQKTPDSVWTAVP 358 

Qy 358 DERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVR 417 

::: I I I I I I I I I I I I II : I I I : I I : I I I : I I I I I III I : III : I II 

Db 359 EDKVPKPRPGCCAKHGLAEAYKTSIDFPDETLSFIKSHPLMDSAVPPIADEPWFTKTRVR 418 

Qy 418 YRLTKI AVDTAAGP YQNHT WFLGS EKGI I LKFLARI GNS GF- LNDS LFLEEMS VYNS EK 476 

I I I I I : I I : I I I I I I : I I : I : I I I I :: I I I I : I I I I I I : I I I : II I 

Db 419 YRLTAI SVDHSAGPYQNYTVI FVGSEAGMVLKVLAK — TSPFSLNDSVLLEEIEAYNHAK 476 

Qy 477 CSYDGVEDKRIMGMQLDRASSSLYVAFSTCVIKVPLGRCERHGKCKKTCIASRDPYCGWI 536 

II : III::: : I I I : : I I I II I : I : I : : I I I II I : I I I I : I I I I I I I I I I I : 

Db 477 CSAENEEDKKVISLQLDKDHHALYVAFSSCIIRIPLSRCERYGSCKKSCIASRDPYCGWL 536 



Qy 



537 KEGGACSHLSPNSRLT-FEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQE 595 



Db 537 SQ-GSCGRVTPGMLAEGYEQDTEFGNTAHLGDCHG 570 

Qy 596 GYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RE5YLKGHDQLVPVTLLAIAV 655 

: I : : I : I : I : I : : I I 

Db 571 VRWE— VQSGES NQMVHMNVL I T CV 593 

Qy 656 ILAFVMGAVFSGITVYCVCD-HRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFG 710 

111:11 : I : I I I I I I : : I : I I I I I I : I I I 
Db 594 FAAFVLGAF I AGVAVYC YRDM FVRKN RK I — HKDAESAQSCTDSSGSFAKLNGLFDSPVK 651 

Qy 711 DTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKP 770 

: I I : I : I hi I : : : I I I I I I I I I I I I I 

Db 652 EYQQNIDSPKLYSNLLTSRKELPPNGDTKSMVMDHRGQPPELAALPTPESTPVLHQKTLQ 711 

Qy 771 SRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEY- 829 

: I : : : I I : I I I I I I I I I : I I I I : 

Db 712 AMKSHSEKAHGH— GASRKETPQFFPSSPPPHSPL — SHGHI PSAI VLPNATHDYNTS FS 767 

Qy 830 VDQP KMS EVAQMALEDQAATLE YKT I KEHLS S KS PN 8 65 

: I I I : I II : : | | : : | 

Db 768 NSNAHKAEKKLQNIDHPLTKSSSKRDHRRSVDSRNTL — NDLLKHLNDPNSNPKAIMGDI 825 

Qy 866 HGVNLVENLDSL PPKVPQREASLGPPGASLSQTGLSKRLEMHHSSSY GVDY 916 

I I I I I I I I I I I I : : I : : I I : : : : : : 

Db 82 6 QMAHQNLMLDPMGSMSEVPPKVPNREASLYSPPSTLPRNSPTKRVDVPTTPGVPMTSLER 8 85 

Qy 917 KRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGD-NPPPAPQRVDSIQVHSSQ 975 

: I I II II : : : I I I I III I II II : I I I I 
Db 886 QRGYHKNSSQR-HSISAMPK-NLNSPNGVLLSRQPSMNRGGYMPTPTGAKVDYIQ 938 

Qy 976 PSGQAVTVSRQPSLNAYNSLT RSGLKRTPSLKPDVPPKPSFAPLSTSMKP 1025 

I I : I MM: :|| I : I I I I I I I II I I I I I II I I I : I :: I 

Db 939 — GTPVSVHLQPSLSRQSSYTSNGTLPRTGLKRTPSLKPDVPPKPSFVPQTPSVRP 992 



RESULT 12 
Q8NFY3 

ID Q8NFY3 PRELIMINARY; PRT; 1011 AA. 

AC Q8NFY3; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel . 22, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Semaphorin 6D isoform 1. 

GN SEMA6D. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE-Brain; 

RA Qu X., Wei H., Zhai Y., Yu Y., Tang F., He F. ; 

RT "Homo sapiens semaphorin 6D isoform 1 (SEMA6D.1) mRNA, complete cds . " ; 

RL Submitted (JUN-2001) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF389430; AAM69453.1; 

DR InterPro; IPR001627; Sema. 



DR Pfam; PF01403; Sema; 1. 
DR SMART; SM00630; Sema; 1. 

SQ SEQUENCE 1011 AA; 113289 MW; 9D6B8B363394 1B89 CRC64 ; 



Query Match 42.5%; Score 2314; DB 4; Length 1011; 

Best Local Similarity 45.4%; Pred. No. 1.8e-179; 

Matches 494; Conservative 155; Mismatches 292; Indels 148; Gaps 26; 

Qy 1 MRSEALLLYFTLL HFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLD 57 

II I I I I I I I I I I : : : I : : I I I I I I : I I : I I I I I 

Db 1 MRVFLLCAYILLLMVSQLRAVSFPEDDEPLNTVDYHYSRQYPVFRG-RPSGNESQ-HRLD 58 

Qy 58 IQMIMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDEC 117 

I : : : : I I I I I I I : I I I : : : I : : I I I I I : I I I I : I I I I I I I I I I 

Db 59 FQLMLKIRDTLYIAGRDQVYTVNLNEMPKTEVIPNKKLTWRSRQQDRENCAMKGKHKDEC 118 

Qy 118 HNFIKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALF 177 

I I I I I I : : I I : : I I I I II I I I I II I : : III I : I I I : I I I I : I I : I I I I I 
Db 119 HNFIKVFVPRNDEMVFVCGTNAFNPMCRYYRLSTLEYDGEEISGLARCPFDARQTNVALF 178 

Qy 178 ADGKLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFR 237 

I I I II I I I I I I II I I I I I I I I : i : I I I : I : I I I I : I I I : I : I : : I I : I : I I I I I 
Db 179 ADGKLYSATVADFLASDAVIYRSMGDGSALRTIKYDSKWIKEPHFLHAIEYGNYVYFFFR 238 

Qy 238 EIAVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLK7VRLNCSVPGDSHFYFNILQ 297 

11111:1 : I I I : I I I : : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : : I I 
Db 239 EIAVEHNNLGKAVYSRVARI CKNDMGGSQRVLEKHWTS FLKARLNCSVPGDS FFYFDVLQ 298 

Qy 298 AVTDVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVP 357 

: : M : I : I I I I : I : I I II I I I I I I I : I II II I I I I I I I : I I I II II 
Db 299 SITDIIQINGIPTWGVFTTQLNSIPGSAVCAFSMDDIEKVFKGRFKEQKTPDSVWTAVP 358 

Qy 358 DERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVR 417 

::: I I I I I I I I I I I I II : I I I : I I : I I I : I I I I I III I : Ml : I II 

Db 359 EDKVPKPRPGCCAKHGLAEAYKTS I DFPDETLS FI KSHPLMDSAVPPI ADEPWFTKTRVR 418 

Qy 418 YRLTKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGF-LNDSLFLEEMSVYNSEK 476 

I I I I I : I I : I I I I I I : I I : I : I II I : : II I I : I I I I I I : III: II I 

Db 419 YRLTAISVDHSAGPYQNYTVI FVGSEAGMVLKVLAK — TSPFSLNDSVLLEEIEAYNHAK 476 

Qy 477 CSYDGVEDKRIMGMQLDRASSSLYVAFSTCVIKVPLGRCERHGKCKKTCIASRDPYCGWI 536 

II : III::: :lll: : I I I I I I : I : I : : I I I I I I : I I I I : I I I I I I I I I I I : 

Db 477 CSAENEEDKKVISLQLDKDHHALYVAFSSCIIRIPLSRCERYGSCKKSCIASRDPYCGWL 536 

Qy 537 KEGGACSHLSPNSRLT FEQDI ERGNTDGLGDCHNS FVALNGHSS S L 582 

: I : I : : I I : I I I I I I I I I I I I 
Db 537 SQ-GSCGRVTPGMLLLTEDFFAFHNHSAEGYEQDTEFGNTAHLGDCHG 583 

Qy 583 LPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRESYLKGH 642 

: I : : I : I 

Db 584 VRWE — VQSGES 593 

Qy 643 DQLVPVTLLAIAVILAFVMGAVFSGITVYCVCD-HRRKDVAWQRKEKELTHSRRGSMSS 701 

: I : I : : I I 111 = 11 : I : I I I I I I : : I : I I I I 
Db 594 NQMVHMNVLITCVF7\AFVLGAFIAGVAVYCYRDMFVRKNRKI — HKDAESAQSCTDSSGS 651 

Qy 702 VTKLSGLFG DTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPT 757 



11*111 • I I • I - I I • I I • • * I I I I I 

Db 652 FAKLNGLFDSPVKEYQQNIDSPKLYSNLLTSRKELPPNGDTKSMVMDHRGQPPELAALPT 711 

Qy 758 PESTPTLQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSWV 817 

I I I I I I I I : I : : : I I : I I I I I I I I I : I 

Db 712 PESTPVLHQKTLQAMKSHSEKAHGH — GASRKETPQFFPSSPPPHSPL — SHGHIPSAIV 767 

Qy 818 LPITQQGYQHEY VDQP KMSEVAQMALEDQAATLEYKTIKEHLS 860 

II I : ' I I I: III : :lh 

Db 768 LPNATHDYNTSFSNSNAHKAEKKLQNIDHPLTKSSSKRDHRRSVDSRNTL — NDLLKHLN 825 

Qy 861 SKSPN HGVNLVENLDSL PPKVPQREASLGPPGASLSQTGLSKRLEM 906 

: I I : : : : I : I I II I I I I I I I : : I : : I I : : : 

Db 826 DPNSNPKAIMGDIQMAHQNLMLDPMGSMSEVPPKVPNREASLYSPPSTLPRNSPTKRVDV 885 

Qy 907 HHSSSY GVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGD-NPPPA 962 

: : : : | | | | | | : : : | | I I I I I I I I II 

Db 886 PTTPGVPMTSLERQRGYHKNSSQR-HSISAMPK-NLNSPNGVLLSRQPSMNRGGYMPTPT 943 

Qy 963 PQRVD S I QVH S S Q P S GQAVTVS RQ P S LNAYN S LT RSGLKRTPSLKPDVPPKPSF 1016 

: I I I I I I : I MM: : I I I : II M I II I II I I I I I I I I 

Db 944 GAKVDYIQ GTPVSVHLQPSLSRQSSYTSNGTLPRTGLKRTPSLKPDVPPKPSF 996 

Qy 1017 APLSTSMKP 1025 

I : MM 

Db 997 VPQTPSVRP 1005 



RESULT 13 
Q9P249 

ID Q9P249 PRELIMINARY; PRT; 1022 AA. 

AC Q9P249; 

DT 01-OCT-2000 (TrEMBLrel. 15, Created) 

DT 01-OCT-2001 (TrEMBLrel. 18, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel . 24, Last annotation update) 

DE Hypothetical protein KIAA1479 (Fragment) . 

GN KIAA1479. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20277482; PubMed=l 0819331; 

RA Nagase T., Kikuno R. , Ishikawa K. , Hirosawa M. , Ohara O.; 

RT "Prediction of the coding sequences of unidentified human 

RT genes .XVII .The complete sequences of 100 new cDNA clones from brain 

RT which code for large proteins in vitro."; 

RL DNA Res. 7:143-150(2000). 

DR EMBL; AB040912; BAA96003.2; -. 

DR GO; GO: 0007275; P : development ; IEA. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

KW Hypothetical protein. 



FT NON_TER 1 1 

SQ SEQUENCE 1022 AA; 114372 MW; BE4FBD5EA02C69C4 CRC64; 



Query Match 42.5%; Score 2314; DB 4; Length 1022; 

Best Local Similarity 45.4%; Pred. No. 1.8e-179; 

Matches 494; Conservative 155; Mismatches 292; Indels 148; Gaps 26; 

Qy 1 MRSEALLLYFTLL HFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLD 57 

II I I II I I II I I : : : I : : I I I I I I : I I : I I I I I 

Db 12 MRVFLLCAYILLLMVSQLRAVSFPEDDEPLNTVDYHYSRQYPVFRG-RPSGNESQ-HRLD 69 

Qy 58 IQMIMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDEC 117 

I : : : : I I I I I I I : I I I : : : I : : I I I I I : I I I I : I I I I I I I I I I 

Db 70 FQLMLKIRDTLYIAGRDQVYTVNLNEMPKTEVIPNKKLTWRSRQQDRENCAMKGKHKDEC 129 

Qy 118 HNFIKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALF 177 

I I I I I I : : I I : : I I I I I I I I I I II I : : III I : I II : I I I I : I I : I I I I I 
Db 130 HNFIKVFVPRNDEMVFVCGTNAFNPMCRYYRLSTLEYDGEEISGL7VRCPFDARQTNVALF 189 

Qy 178 ADGKLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFR 237 

I I I I I II I M I I I I I I I I II I : I : I I I : I : I I I I : I I I : I : I :: I I : I : M I I I 
Db 190 ADGKLYSATVADFLASDAVI YRSMGDGSALRTI KYDS KWI KEPHFLHAI EYGNYVYFFFR 249 

Qy 238 EIAVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQ 297 

I I I I I : I : I I I : I I I -: : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I : : I I 

Db 250 EIAVEHNNLGKAVYSRVARICKNDMGGSQRVLEKHWTSFLKARLNCSVPGDSFFYFDVLQ 309 

Qy 298 AVTDVI RINGRDVVLATFSTP YNS I PGSAVCAYDMLDIASVFTGRFKEQKS PDSTWTPVP 357 

: : I I : I : I I I I : I : I I I I I I I I I I I : I I I I I I I I I I I I : I I I I I I I 
Db 310 S ITDI IQINGI PTWGVFTTQLNS I PGSAVCAFSMDDI EKVFKGRFKEQKTPDS WTAVP 369 

Qy 358 DERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVR 417 

::: I I I I II I I I I I I II : I I I : I I : I I I : I I I I I III I : III : I I I 

Db 370 EDKVPKPRPGCCAKHGLAEAYKTSIDFPDETLSFIKSHPLMDSAVPPIADEPWFTKTRVR 429 

Qy 418 YRLTKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGF-LNDSLFLEEMSVYNSEK 476 

I I I I I : I I : I I I I I I : I I : I : I I I I : : I I I I : I I I I I I : I I I : II I 

Db 430 YRLTAISVDHSAGPYQNYTVI FVGSEAGMVLKVLAK — TSPFSLNDSVLLEEIEAYNHAK 487 

Qy 477 CSYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWI 536 

II : III::: : I I I : : I I I I I I : I : I : : I I I I I I : I I I I : I I I I I I I I I I I : 

Db 488 CSAENEEDKKVISLQLDKDHHALYVAFSSCIIRIPLSRCERYGSCKKSCIASRDPYCGWL 547 

Qy 537 KEGGACSHLSPNSRLT FEQDIERGNTDGLGDCHNSFVALNGHSSSL 5 82 

: I : I :: I I : I I I I II I I I I I I 
Db 548 SQ-GSCGRVTPGMLLLTEDFFAFHNHSAEGYEQDTEFGNTAHLGDCHG 594 

Qy 583 LPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRESYLKGH 642 

: I : : I : I 

» Db 595 VRWE — VQSGES 604 

Qy 643 DQLVPVTLLAIAVILAFVMGAVFSGITVYCVCD-HRRKDVAWQRKEKELTHSRRGSMSS 7 01 

: I : I : : I I I I I : I I : I : I I I I I I : : I : I I I I 
Db 605 NQMVHMNVL I T CVFAAFVLGAFI AGVAVYC YRDMFVRKNRK I — HKDAESAQSCTDSSGS 662 

Qy 702 VTKLSGLFG DTQSKDPKPEAI LT PLMHNGKLAT PGNTAKMLIKADQHHLDLTALPT 757 

||:||| : I I: I :| hi |:: * I I I I I 



Db 



663 FAKLNGLFDSPVKEYQQNIDSPKLYSNLLTSRKELPPNGDTKSMVMDHRGQPPELAALPT 722 



Qy 758 PESTPTLQQKRKPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVW 817 

I I I I I I I I : I : : : I hi I II I I I I I M 

Db 723 PESTPVLHQKTLQAMKSHSEKAHGH — GASRKETPQFFPSSPPPHSPL — SHGHIPSAIV 778 

Qy 818 LPITQQGYQHEY VDQP KM S EVAQMAL E DQAAT LEYKTIKEHLS 860 

II I : : I I I : III : : I I : 

Db 779 LPNATHDYNTSFSNSNAHKAEKKLQNIDHPLTKSSSKRDHRRSVDSRNTL — NDLLKHLN 836 

Qy 861 SKSPN HGVNLVENLDSL PPKVPQREASLGPPGASLSQTGLSKRLEM 906 

: I I : : : : I : I I II I I I I I I I : : I : : I I : : : 

Db 837 DPNSNPKAIMGDIQMAHQNLMLDPMGSMSEVPPKVPNREASLYSPPSTLPRNSPTKRVDV 896 

Qy 907 HHSSSY GVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGD-NPPPA 962 

: : : : I I I I I I : : : I I I I I I I I I I II 

Db 897 PTTPGVPMTSLERQRGYHKNSSQR-HSISAMPK-NLNSPNGVLLSRQPSMNRGGYMPTPT 954 

Qy 963 PQRVD S I QVH S S Q P S GQAVT VS RQ P S LN AYN S LT RSGLKRTPSLKPDVPPKPSF 1016 

Mill I I : I MM: Ml I M I I I II I II II I M M I I 

Db 955 GAKVDYIQ GTPVSVHLQPSLSRQSSYTSNGTLPRTGLKRTPSLKPDVPPKPSF 1007 

Qy 1017 APLSTSMKP 1025 

I : I : M 

Db 1008 VPQTPSVRP 1016 



RESULT 14 
Q80TD0 

ID Q80TD0 PRELIMINARY; PRT; 1009 AA. 

AC Q80TD0; 

DT 01-JUN-2003 (TrEMBLrel . 24, Created) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE MKIAA1479 protein (Fragment). 

GN MKIAA1479. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=2257 9291; PubMed=12 693553 ; 

RA Okazaki N., Kikuno R., Ohara R. , Inamoto S., Aizawa H., Yuasa S., 

RA Nakajima D. , Nagase T., Ohara O., Koga H.; 

RT "Prediction of the coding sequences of mouse homologues of KIAA gene: 

RT II. The complete nucleotide sequences of 400 mouse KIAA-homologous 

RT cDNAs identified by screening of terminal sequences of cDNA clones 

RT randomly sampled from size-fractionated libraries."; 

RL DNA Res. 10:35-4 8(2 003). 

DR EMBL; AK122515; BAC65797.1; -. 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00630; Sema; 1. 

FT NON__TER 1 1 

SQ SEQUENCE 1009 AA; 112808 MW; 7509F0B67332316B CRC64; 



Query Match 42.4%; Score 2309.5; DB 11; Length 1009; 

Best Local Similarity 46.1%; Pred. No. 4.1e-179; 

Matches 497; Conservative 154; Mismatches 280; Indels 147; Gaps 



30; 



Qy 6 LLLYFTLL HFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

I I I : I : I I I I I I I : : : I : : I I I I I I : I I : I I I I I I : 

Db 15 LLLWFCVLFLLVSRLRAVSFPEDDEPLNTVDYHYSRQYPVFRG-RPSGNESQ-HRLDFQL 72 

Qy 61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

:: : I I I I I I I : I I I : : : I : I I M I I : I I I I : I I I I I I I I I I I I I 

Db 73 MLKIRDTLYIAGRDQVYTVNLNEIPQTEVIPSKKLTWRSRQQDRENCAMKGKHKDECHNF 132 

Qy 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANV7VLFADG 180 

III : : I I : : I I I I I I I II I II I : : III hi I I : I I I I : I I : I I I I I I I I 

Db 133 IKVFVPRNDEMVFVCGTNAFNPMCRYYRLRTLEYDGEEISGLARCPFDARQTNVALFADG 192 

Qy 181 KLYSATVTDFLAI DAVI YRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

I I I I I I I I I I I 1111111:1: I I I : I : I I I I : I I I : I : I : : I I : I : I I I I I I I I 

Db 193 KLYSATVADFLASDAVIYRSMGDGSALRTIKYDSKWIKEPHFLHAIEYGNYVYFFFREIA 252 

Qy 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

I I : I : I I I : I I I : : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : : I I : : I 

Db 253 VEHNNLGKAVYSRVARI CKNDMGGSQRVLEKHWTS FLKARLNCSVPGDS FFYFDVLQS IT 312 

Qy 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

I : I : I I I I : I : I 1111111111:111 I I I I I I I I I : I I I I I I I : : : 

Db 313 DIIQINGIPTWGVFTTQLNSIPGSAVCAFSMDDIEKVFKGRFKEQKTPDSVWTAVPEDK 372 

Qy 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

I I I I I I I I I I I I II : I I I I I I llhlllll III I : III : I I I I I I 

Db 373 VPKPRPGCCAKHGLAEAYKTSIDFPDDTLAFIKSHPLMDSAVPPIADEPWFTKTRVRYRL 4 32 

Qy 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGF-LNDSLFLEEMSVYNSEKCSY 479 

I I I I : I I I I I I : I I : I : I I I I :: I I I I : I I I I I I : I I I : II III 

Db 433 TAIEVDRSAGPYQNYTVIFVGSEAGWLKVLAK — T S P FS LNDS VLLEEI EAYN PAKCSA 490 

Qy 480 DGVEDKRIMGMQLDRASSSLYVAFSTCVIKVPLGRCERHGKCKKTCIASRDPYCGWIKEG 539 

: II:::: MM: : I I I I I I : M : : : I I INI: I I I : I I I I I I I I I I I : : 

Db 491 ESEEDRKWSLQLDKDHHALYVAFSSCWRIPLSRCERYGSCKKSCIASRDPYCGWLSQ- 549 

Qy 540 GACSHLS PNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQE 595 

II : : I : I II I I I I I I I I I 

Db 550 GVCERVTLGMLPGG YEQDTEYGNTAHLGDCHG 581 

Qy 596 GYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHDQLVPVTLLAIAV 655 

: I: : I :| :|:| : :| I 

Db 582 VRWE — VQSGES NQMVHMNVL I T CV 604 

Qy 656 I ]^FVMGAVFS GI T WCVCD-HRRKDVAWQRKEKELTHS RRGSMS S VTKLS GLFGDTQS 714 

I I I : I I : I : I II I I I : : hi I I I I h I I I h 

Db 605 FAAFVLGAFIAGVAVYCYRDMFVRKNRKI — HKDAESAQSCTDSSGSFAKLNGLF-DSPV 661 

Qy 715 KDPKPEAILTPLMHNGKLAT PGNTAKMLIKADQHHL — DLTALPTPESTPTLQQKR 768 

I : : : I : I : : : I : III : I : I I I I I II I I I I I I 

Db 662 KEYQ-QNIDSPKLYSNLLTSRKELPPNTDTKSMAVDHRGQPPELAALPTPESTPVLHQKT 720 



769 KPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSWVLPITQQGYQHE 828 

: I : : : I I : I I I I I I I I I : I I I I 

721 LQAMKSHSEKAHSH--GASRKEHPQFFPSSPPPHSPL— SHGHIPSAIVLPNATHDYNTS 776 

82 9 Y VDQP KMSEVAQMALEDQAATLEYKTIKEHLSSKSPN 8 65 

: : I I I : III : : I | : : I 

777 FSNSNAHKAEKKLQSMDHPLTKSSSKREHRRSVDSRNTL — NDLLKHLNDPNSNPKAILG 834 

866 — HGWLVENLD SLPPKVPQREASLGPPGASLSQTGLSKRLEMHHSSSY GV 914 

I : II : I I I I I I I I I I I : : I : : I I : : : : : 

835 EIHMAHQTLMLDPVGPMAEVPPKVPNREASLYSPPSTLPRNSPTKRVDVPTTPGVPMTSL 894 

915 DYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGD-NPPPAPQRVDSIQVHS 973 

: : I I I I II : : : I I I | III I II II : I I I I 
895 ERQRGYHKNSSQR-HSISAVPK-NLNSPNGVLLSRQPSMNRGGYMPTPTGAKVDYIQ 949 

974 SQPSGQAVTVSRQPSLNAYNSLT RSGLKRTPSLKPDVPPKPSFAPLSTSMKP 1025 

I I : I MM: : | | | : | | | I I II I I II I I I I I II I M I : : I 

950 GTPVSVHLQPSLSRQSSYTSNGTLPRTGLKRTPSLKPDVPPKPSFVPQTTSVRP 1003 



RESULT 15 




Q96JU9 




ID 


Q96JU9 PRELIMINARY; PRT; 418 AA. 




AC 


Q96JU9; 




DT 


01-DEC-2001 (TrEMBLrel. 19, Created) 




DT 


01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 




DT 


01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 




DE 


Hypothetical protein FLJ14961. 




OS 


Homo sapiens (Human) . 




OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 


OC 


Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 




OX 


NCBI TaxID=9606; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RC 


TISSUE=Placenta; 




RA 


Isogai T., Ota T., Hayashi K., Sugiyama T., Otsuki T., Suzuki 


Y., 


RA 


Nishikawa T., Nagai K., Sugano S., Takahashi-Fuj ii A., Hara H 




RA 


Tanase T., Nomura Y., Togiya S., Komai F., Hara R. , Takeuchi 


K., 


RA 


Arita M. , Nabekura T., Ishii S., Kawai Y., Saito K., Yamamoto 




RA 


Wakamatsu A., Nakamura Y., Nagahari K. , Masuho Y. , Oshima A.; 




RT 


"NEDO human cDNA sequencing project."; 




RL 


Submitted (MAY-2001) to the EMBL/GenBank/DDBJ databases. 




DR 


EMBL; AK027867; BAB55418.1; -. 




DR 


InterPro; IPR001627; Sema. 




DR 


Pfam; PF01403; Sema; 1. 




DR 


SMART; SM0 0630; Sema; 1. 




KW 


Hypothetical protein. 




SQ 


SEQUENCE 418 AA; 47415 MW; 2E4 9A3F6C0564 307 CRC64; 





Query Match 40.6%; Score 2213; DB 4; Length 418; 

Best Local Similarity 100.0%; Pred. No. 7.3e-172; 

Matches 415; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 62 MIMNGTLYIAARDHI YTVDIDTSHTEEI YCSKKLTWKSRQADVDTCRMKGKHKDECHNFI 121 

I I II I II I II I I II M I I I II I II I I II I II I II M II I I M II I II I I I II I II I II I I 

Db 1 MIMNGTLYIAARDHI YTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNFI 60 



QY 
Db 

Qy 

Db 

QY 
Db 

QY 
Db 

QY 
Db 



Qy 


122 


KVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADGK 


181 




1 I I I I I 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 




Db 


OX 


K\H T PTKMnDAT FVrnTNAFNPSCRNYKMnTLFPFGDF.FSGN^^ 


120 


Qy 


182 


LYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAV 


241 




M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 




Db 




T ZiTArrnFT ZXTnAVT YR ST.f^F^PTT.RTVKHfl^KWT.KFPYFVOAVDYGDYI YFFFREIAV 


180 


Qy 


242 


EYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVTD 


301 




I I 1 I 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 > 1 1 1 1 1 1 1 




Db 


1 0 1 


T?VTvT r PM , r*T^\A7'TrDDA/ZiriA/PT^Mnivrcn c inR'\/T FfcTPtT/JT^FT K"ART/NPSVP^n^HFYFNT T.OAVTD 


240 


Qy 


302 


VIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDERV 


361 




I I 1 I I 1 M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 




Db 


X 


uTDTMrDmnn 1 aTPQTDVMciT pn^ A VP A YTiMT HT A ^VFTHR FKFOKS PD iTWTPVPDF.RV 


300 


Qy 




PtfPPPrrrarci^ciT FRYAT^NFFPnTYFT.NFT KTHPT.MDFAVPSIFNRPWFLRTMVRYRLT 


421 




I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 




Db 


301 


P K P RP GC GAG S S S L E RYAT SNEFPDDTLNFIKTHP LMD EAVP S I FN RP W FL RTMVR YRLT 


360 


Qy 


422 


KIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEK 47 6 






I I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 




Db 


361 


KIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEK 415 





Search completed: March 24, 2004, 13:16:25 
Job time : 75.9691 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 
Perfect score: 5450 



March 24, 2004, 13:07:38 ; Search time 20.5626 Seconds 

(without alignments) 
2608.241 Million cell updates/sec 

US-09-856-681A-2 



Sequence : 



1 MRSEALLLYFTLLHFAGAGF PPKPSFAPLSTSMKPNDACT 1030 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 141681 seqs, 52070155 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



141681 



Database 



SwissProt 42:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



Query 



o . 


Score 


Match 


Length 


DB 


ID 




Description 


1 


5450 


100. 


.0 


1030 


1 


SM6A_ 


HUMAN 


Q9h2e6 


homo sapien 


2 


4346.5 


79. 


, 8 


888 


1 


SM6A_ 


MOUSE 


035464 


mus musculu 


3 


2013.5 


36. 


, 9 


888 


1 


SM6B 


"human 


Q9h3t3 


homo sapien 


4 


1997 


36. 


.6 


887 


1 


SM6B 


"rat 


070141 


rattus norv 


5 


1995 


36. 


,6 


886 


1 


SM6B_ 


MOUSE 


054951 


mus musculu 


6 


1515.5 


27. 


.8 


930 


1 


SM6C 


"human 


Q9h3t2 


homo sapien 


7 


1493.5 


27. 


,4 


931 


1 


SM6C 


"mouse 


, Q9wtm3 


mus musculu 


8 


1491 


27. 


, 4 


960 


1 


SM6C_ 


[rat 


Q9wtl3 


rattus norv 


9 


971.5 


17. 


,8 


730 


1 


SM1A_ 


[SCHAM 


Q26473 


schistocerc 


10 


932.5 


17. 


, 1 


771 


1 


SM1A_ 


[drome 


Q24322 


drosophila 


11 


858 


15. 


,7 


772 


1 


SM3A_ 


"rat 


Q63548 


rattus norv 


12 


853 


15, 


,7 


772 


1 


SM3A_ 


[mouse 


008665 


mus musculu 


13 


852.5 


15. 


.6 


771 


1 


SM3A_ 


_HUMAN 


Q14563 


homo sapien 
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14 . 


1 


1093 


1 


SM5B 


TTT TXjTT\ XT 

HUMAN 


n„ o o o 

Qypzo j 
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MOUSE 
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28 
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14 . 
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[mouse 


Q64151 


mus musculu 


42 
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1 


SM4B^ 


HUMAN 


Q9npr2 


homo sapien 


43 
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12. 
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1 


SM4A_ 


[human 


Q9h3sl 


homo sapien 


44 


664 


12. 
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rattus norv 


45 
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1 
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ALIGNMENTS 



RESULT 1 
SM6A_HUMAN 

ID SM6A_HUMAN STANDARD; PRT; 1030 AA. 

AC Q9H2E6; Q9P2H9; 

DT 10-OCT-2003 (Rel. 42, Created) 

DT 10-OCT-2003 (Rel. 42, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Semaphorin 6A precursor (Semaphorin VIA) (Sema VIA) (Semaphorin 6A-1) 

DE (SEMA6A-1) . 

GN SEMA6A OR KIAA1368. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A., AND INTERACTION WITH EVL . 

RX MEDLINE=20564339; PubMed=10993894 ; 

RA Klostermann A., Lutz B., Gertler F. , Behl C. ; 

RT "The orthologous human and murine semaphorin 6A-1 proteins 

RT (SEMA6A-l/Sema6A-l) bind to the enabled/ vasodilator-stimulated 

RT phosphoprotein-like protein (EVL) via a novel carboxyl-terminal 

RT zyxin-like domain."; 

RL J. Biol. Chem. 275:39647-39653(2000). 

RN [2] 



RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=20181126; PubMed=10718198 ; 

RA Nagase T., Kikuno R. , Ishikawa K.-I., Hirosawa M. , Ohara O.; 

RT "Prediction of the coding sequences of unidentified human genes. XVI. 

RT The complete sequences of 150 new cDNA clones from brain which code 

RT for large proteins in vitro."; 

RL DNA Res. 7:65-73(2000). 

CC -!- FUNCTION: Can act as repulsive axon guidance cues. May play a role 
CC in channeling sympathetic axons into the sympathetic chains and 

CC controlling the temporal sequence of sympathetic target 

CC innervation (By similarity) . 

CC -!- SUBUNIT: Active as a homodimer or oligomer. Interacts with EVL. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l; 

CC IsoId=Q9H2E6-l; Sequence=Displayed; 

CC Name=2 ; 

CC IsoId=Q9H2E6-2; Sequence=VSP_007 113 ; 

CC Note=No experimental confirmation available; 

CC SIMILARITY: Belongs to the semaphorin family. 

CC -!- SIMILARITY: Contains 1 Sema domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb~sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF279656; AAG29378.1; 

DR EMBL; AB037789; BAA92606.1; ALT_INIT. 

DR Genew; HGNC: 10738; SEMA6A. 

DR MIM; 605885; -. 

DR GO; GO: 0030424; C:axon; NAS . 

DR GO; GO: 0016021; C: integral to membrane; NAS. 

DR GO; GO: 0008580; F: cytoskeletal regulator activity; NAS. 

DR GO; GO:0005515; F: protein binding; IPI. 

DR GO; GO: 0006915; P:apoptosis; NAS. 

DR GO; GO: 0007411; P:axon guidance; NAS. 

DR GO; GO: 0007166; Prcell surface receptor linked signal transdu. . .; NAS. 

DR GO; GO: 0007399; P : neurogenesis ; NAS. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

KW Signal; Transmembrane; Multigene family; Neurogenesis; Glycoprotein; 

KW Developmental protein; Alternative splicing. 

FT SIGNAL 1 18 POTENTIAL . 

FT CHAIN 19 1030 SEMAPHORIN 6A. 

FT DOMAIN 19 64 9 EXTRACELLULAR (POTENTIAL) . 

FT TRANSMEM 650 67 0 POTENTIAL. 

FT DOMAIN 671 1030 CYTOPLASMIC (POTENTIAL) . 

FT DOMAIN 56 491 SEMA. 
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FT 


CARBOHYD 
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N-LINKED 


(GLCNAC. . 


. ) (POTENTIAL). 


FT 


CARBOHYD 


461 


461 


N-LINKED 


(GLCNAC. . 


. ) (POTENTIAL). 


FT 


VARSPLIC 


576 


576 


N -> NDISTPLPDNEMSYNTVY (in isoform 


FT 








/FTId=VSP 


007113. 




SQ 


SEQUENCE 


1030 


AA; 114368 


MW; A57B79C10AEC4B34 


CRC64; 



Query Match 100.0%; Score 5450; DB 1; Length 1030; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1030; Conservative 0; Mismatches 0; Indels 0; 



Gaps 



Qy 

Db 



0; 



MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 



Qy 

Db 

Qy 

Db 



61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I 

61 IMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 180 



Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



181 



181 



241 



241 



301 



301 



361 



361 



421 



421 



481 



KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 24 0 

VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

DVI RINGRDWLATFSTP YNS I PGSAVCAYDMLDIASVFTGRFKEQKS PDSTWTPVPDER 360 

VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

TKIAVXJTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 4 80 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 4 80 



540 



GVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
481 GVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCI ASRDP YCGWI KEGG 540 

541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESR 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESR 600 

601 GGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHDQLVPVTLLAIAVI LAFV 660 
I I II I I M I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 



601 



GGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRESYLKGHDQLVPVTLLAIAVILAFV 660 



Qy 661 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 72 0 

Qy 721 AILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERN 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I IN I I I I I I I I I 
Db 721 AILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERN 7 80 

Qy 781 QNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVAQ 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I - 
Db 781 QNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVAQ 840 

Qy 841 MALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLSQTGL 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 MALEDQAATLEYKTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREASLGPPGASLSQTGL 900 

Qy 901 SKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGDNPP 960 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 901 SKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGDNPP 960 

Qy 961 P APQRVD S I QVH S SQ P S GQAVT VS RQ P S LNAYN S LT RS GLKRT P S LKP DVP P KP S FAP L S 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

Db 961 P APQRVD S I QVH S SQ P S GQAVTVS RQ P S LNAYN S LT RS GLKRT PS LKP DVP P KP S FAP L S 1020 

Qy 1021 TSMKPNDACT 1030 

I I I I I I I I I I 
Db 1021 TSMKPNDACT 1030 



RESULT 2 
SM6A MOUSE 



ID 
AC 
DT 
DT 
DT 
DE 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RX 
RA 
RA 
RT 
RT 
RL 
RN 
RP 
RX 
RA 



SM6A_MOUSE 
035464; 
30-MAY-2000 
30-MAY-2000 
15-MAR-2004 



STANDARD ; 



PRT; 



888 AA. 



Craniata; Vertebrata; Euteleostomi ; 
Rodentia; Sciurognathi ; Muridae; Murinae; Mus, 



(Rel. 39, Created) 
(Rel. 39, Last sequence update) 
(Rel. 43, Last annotation update) 
Semaphorin 6A precursor (Semaphorin VIA) (Sema VIA) (Semaphorin 6A-1) 
(SEMA6A-1) (Semaphorin Q) (Sema Q) . 
SEMA6A OR SEMAQ. 
Mus mus cuius (Mouse) . 
Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; 
NCBI_TaxID=10090; 
[1] 

SEQUENCE FROM N.A. 

MEDLINE=973484 68; PubMed=92 04478 ; 

Zhou L., White F.A. , Lentz S.I., Wright D. E . , Fisher D.A., 
Snider W.D. ; 

"Cloning and expression of a novel murine semaphorin with structural 
similarity to insect semaphorin I . " ; 
Mol. Cell. Neurosci. 9:26-41(1997). 
[2] 

INTERACTION WITH EVL. 
MEDLINE=20564339; PubMed=10993894 ; 
Klostermann A., Lutz B., Gertler F. , Behl C; 



RT "The orthologous human and murine semaphorin 6A-1 proteins 

RT (SEMA6A-l/Sema6A-l) bind to the enabled/ vasodilator-stimulated 

RT phosphoprotein-like protein (EVL) via a novel carboxyl-terminal 

RT zyxin-like domain."; 

RL J. Biol. Chem. 275:39647-39653(2000). 

CC -!- FUNCTION: Can act as repulsive axon guidance cues. May play a role 
CC in channeling sympathetic axons into the sympathetic chains and 

CC controlling the temporal sequence of sympathetic target 

CC innervation. 

CC -!- SUBUNIT: Active as a homodimer or oligomer. Interacts with EVL. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- TISSUE SPECIFICITY: Particularly high levels in spinal cord, 
CC cerebellum, metencephalon, superior and inferior colliculus, 

CC diencephalon, olfactory bulb and eye. 

CC -!- DEVELOPMENTAL STAGE: Temporally and spatially regulated during 
CC development. 

CC -!- SIMILARITY: Belongs to the semaphorin family. 

CC -!- SIMILARITY: Contains 1 Sema domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF030430; AAB86408.1; -. 

DR MGD; MGI : 1203727; Sema6a. 

DR GO; GO: 0030424; C:axon; NAS . 

DR GO; GO: 0016020; C : membrane ; NAS. 

DR GO; GO: 0005515; F:protein binding; IPI. 

DR GO; GO: 0006915; P:apoptosis; NAS. 

DR GO; GO: 0007 411; P : axon guidance; NAS. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

KW Signal; Transmembrane; Multigene family; Neurogenesis; Glycoprotein; 

KW Developmental protein. 
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SEMA. 
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49 


N-LINKED 


( GLCNAC . 
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FT 


CARBOHYD 


65 
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. ) (POTENTIAL) . 


FT 


CARBOHYD 


282 
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N-LINKED 


(GLCNAC. 


.) (POTENTIAL). 


FT 


CARBOHYD 


434 


434 


N-LINKED 


(GLCNAC. 


.) (POTENTIAL). 


FT 


CARBOHYD 


461 


461 


N-LINKED 


(GLCNAC. 


. ) (POTENTIAL) . 


SQ 


SEQUENCE 


888 AA; 


99075 


MW; C7094FC2BB345C57 


CRC64; 



Query Match 



79.8%; Score 4346.5; DB 1; Length 888; 



Best Local Similarity 94.4%; Pred. No. 9.7e-273; 

Matches 816; Conservative 24; Mismatches 23; Indels 1; Gaps 1; 

Qy 1 MRSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

II 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MRPAALLLCLTLLHCAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQM 60 

Qy 61 IMIMNGTLYIAARDHIYTVDIDTSHTEEI YCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 61 IMIMNRTLYVAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNF 120 

Qy 121 IKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADG 180 

I I I I I I I I I I I I I I I I I II I I I I I I I : : I I I I I I I I I I I I I I I I I I I I I I I : II I I II 
Db 121 IKVLLKKNDDTLFVCGTNAFNPSCRNYRVDTLETFGDEFSGMARCPYDAKHANIALFADG 180 

Qy 181 KLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

I I I I I I I I I I I I I I I I I I I I I I : I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 181 KLYSATVTDFLAIDAVIYRSLGDSPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIA 240 

Qy 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I II I I I I I 
Db 241 VEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVT 300 

Qy 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I : I II I I I I I I I I I I I I I I I I I I I I 

Db 301 DVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIANVFTGRFKEQKSPDSTWTPVPDER 360 

Qy 361 VPKPRPGCCAGS S SLERYATSNEFPDDTLNFI KTHPLMDEAVPS I FNRPWFLRTMVRYRL 420 

I I I I I I I I I I I I I I I I : I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I II I I I I I 

Db 361 VPKPRPGCCAGS SSLEKYATSNEFPDDTLNFI KTHPLMDEAVPS I INRPWFLRTMVRYRL 420 

Qy 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 480 

I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I : I I I I I I I I I I I I : I I I I I I I I I 
Db 421 TKIAVDNAAGPYQNHTWFLGSEKGIILKFL7VRIGSSGFLNGSLFLEEMNVYNPEKCSYD 480 

Qy 481 GVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I : : I I 
Db 481 GVEDKRIMGMQLDRASGSLYVAFSTCVIKVPLGRCERHGKCKKTCIASRDPYCGWVRESG 540 

Qy 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESR 600 

: I : I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I : I I I I I I I I II I : : : I I I I I 
Db 541 SCAHLSPLSRLTFEQDIERGNTDGLGDCHNSFVALNGHASSLYPSTTTSDSASRDGYESR 600 

Qy 601 GGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRESYLKGHDQLVPVTLLAIAVi LAFV 660 

I I I I II I I : : I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I II I I I I I I I I I I II 
Db 601 GGMLDWNDLLEAPGSTDPLGAVSSHNHQDKKGVIRESYLKSNDQLVPVTLLAIAVI LAFV 660 

Qy 661 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 MGAVFSGIIVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 720 

Qy 721 AILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSREWERN 780 

I I I II I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I : I I I I : I I I I I I I II 
Db 721 AILTPLMHNGKLATPSNTAKMLIKADQHHLDLTALPTPESTPTLQEKRKPNRGSREWERN 7 80 

Qy 781 QNLINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSE-VA 839 

I I : I I I I I I I I I II I I I I I I II I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I II 



Db 



781 QNIINACTKDMPPMGSPVIPTDLPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEWA 840 



Qy 840 QMALEDQAATLEYKTIKEHLSSKS 863 

I I I I II I I I I I I I I I I I I I I I I : I 

Db 841 QMALEDQAATLEYKTIKEHLSSES 8 64 

RESULT 3 
SM6B HUMAN 



ID SM6B_HUMAN STANDARD; PRT; 8 88 AA. 

AC Q9H3T3; Q9NRK9 ; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Semaphorin 6B precursor (Semaphorin Z) (Sema Z). 

GN SEMA6B OR SEMAZ. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 1) . 

RC TISSUE=Brain; 

RA Kimura T . , Ishida H.; 

RL Submitted (JAN-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM 2) . 

RX MEDLINE=2124 868 0; PubMed=11350127 ; 

RA Correa R.G., Sasahara R.M. , Bengtson M.H., Katayama M.L.H., 

RA Salim A. CM., Brentani M.M., Sogayar M.C., de Souza S.J., 

RA Simpson A.J.G.; 

RT "Human semaphorin 6b."; 

RL Genomics 73:343-348(2001). 

CC -!- FUNCTION: MAY PLAY A ROLE IN BOTH PERIPHERAL AND CENTRAL NERVOUS 

CC SYSTEM DEVELOPMENT (BY SIMILARITY) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- ALT E RN AT I VE PRODUCTS: 

CC Event-Alternative splicing; Named isoforms=2; 

CC Comment=Additional is o forms seem to exist; 

CC Narne^l ; 

CC IsoId=Q9H3T3-l; Sequence=Displayed; 

CC Name =2 ; Synonyms=6B . 1 ; 

CC IsoId=Q9H3T3-2; Sequence=VSP_006044 , VSP_006045; 

CC -!- SIMILARITY: Belongs to the semaphorin family. 

CC -!- SIMILARITY: Contains 1 Sema domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

CC 

DR EMBL; AB022433; RAB2 0669.1; -. 

DR EMBL; AF216389; AAF87661.1; 

DR Genew; HGNC: 10739; SEMA6B. 



UK 


InterPro; 


IPR003659; Plexin- 


like. 




Ut\ 


InterPro; 


IPR001627; Sema. 






DK 


Pfam; PF01403; 


Sema ; 1 








DR 


SMART; SM00423; 


PSI; 1 








DR 


SMART; SM00630; 


Sema; 


1. 






KW 


Signal; Transmembrane; 


Multigene family; Neurogenesis; Glycoprotein; 


KW 


Developmental protein; 


Alternative splicing. 


FT 


SIGNAL 


1 


25 




POTENTIAL 




FT 


CHAIN 


26 


888 




SEMAPHORIN 6B. 


FT 


DOMAIN 


26 


603 




EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


604 


624 




POTENTIAL 




FT 


DOMAIN 


r o c 

62b 


8 8 8 




CYTOPLASMIC (POTENTIAL) . 


FT 


DOMAIN 


238 


547 




SEMA. 




FT 


DOMAIN 


661 


674 




POLY-GLY. 




FT 


DOMAIN 


750 


753 




POLY-LEU. 




FT 


CARBOHYD 


74 


74 




N-LINKED 


(GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


155 


155 




N-LINKED 


(GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


167 


167 




N-LINKED 


(GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


291 


291 




N-LINKED 


(GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


386 


386 




N-LINKED 


(GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


441 


441 




N-LINKED 


(GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


4 62 


462 




N-LINKED 


(GLCNAC. . .) (POTENTIAL). 


FT 


VARSPLIC 


471 


517 




LSVFLEEFETYRPDRCGRPGGGETGQRLLSLELDAASGGLL 


FT 










AAFPRC -> 


RVCQVGHACRVCVHERRSWWPQRPGRWLSRR 


FT 










WGFQKARGPPRCRLGV (in isoform 2). 


FT 










/FTId=VSP 


006044. 


FT 


VARSPLIC 


518 


888 




Missing (in isoform 2) . 


FT 










/FTId=VSP 


006045. 


FT 


CONFLICT 


30 


30 




D -> E (IN REF. 2) . 


SQ 


SEQUENCE 


888 AA; 95270 MW 


; 6FFB44D6828C70CB CRC64 ; 



Query Match 36.9%; Score 2013.5; DB 1; Length 888; 

Best Local Similarity 45.3%; Pred. No. 3.6e-122; 

Matches 424; Conservative 129; Mismatches 241; Indels 141; Gaps 20; 

Qy 5 ALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQR — HRLDIQMIM 62 

I I I I II I I I I I I : I : : : I I I I I I I I I I I I : I I : : 

Db 12 ALLLLLLLLGGAHGLFPEDPPPLSVAPRDYLNHYPVFVGSGPGRLTPAEGADDLNIQRVL 71 

Qy 63 IMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNFIK 122 

: I I I : I | I : : I I : : : : I : : I I I I : I : I : : I I I I I I : I I I I : I 
Db 72 RVNRTLFIGDRDNLYRVELEPPTSTELRYQRKLTWRSNPSDINVCRMKGKQEGECRNFVK 131 

Qy 123 VLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADGKL 182 



Db 



132 VLLLRDESTLFVCGSNAFNPVCANYSIDTLQPVGDNISGMARCPYDPKHANVALFSDGML 191 



Qy 



183 YSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVE 242 




Db 



192 FTATVTDFLAI DAVIYRSLGDRPTLRTVKHDSKWFKEPYFVHAVEWGSHVYFFFREIAME 251 



Qy 



Db 



243 YNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVTDV 302 

: I : III I I I : I II I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I 
252 FN Y LE KWVS RVARVC KN DVG GS P RVL E KQWT S FL KARLN C S VP GD S H FY FN VLQAVT GV 311 



QY 



303 IRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDERVP 362 
: : II I I I I I I I I 1111111111:1: :|:M ll|:lllll:l lllll:::!l 



Db 312 VSLGGRPWLAVFSTPSNSIPGSAVCAFDLTQVAAVFEGRFREQKSPESIWTPVPEDQVP 371 

Qy 363 KPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRLTK 422 

: I I I I I I I : I I : III I I I : I I I I I I I I I I I I : - II | | | : : I : : I I : 

Db 372 RPRPGCCAAPGM— QYNASSALPDDILNFVKTHPLMDEAVPSLGHAPWILRTLMRHQLTR 429 

Qy 423 IAVDTAAGPYQNHTWFLGSEKGIILKFLAR — IGNSGFLNDSLFLEEMSVYNSEKCSYD 480 

: I I I I II : I I I I I I I I I I : I I I I I II I : I I I I I : : I 

Db 430 VAVDVGAGPWGNQTWFLGSEAGTVLKFLVRPNASTSGTSGLSVFLEEFETYRPDRCGRP 489 

Qy 481 GVED— KRIMGMQLDRASSSLYVAFSTCVIKVPLGRCERHGKCKKTCIASRDPYCGWIKE 538 

I : : I : : : : I I I I I I I I : : I I : II : : : I I I I I : I I II I I : 
Db 490 GGGETGQRLLS LELDAASGGLLAAFPRCVVRVPVARCQQYS GCMKNC I GSQD P YCGWAPD 549 

Qy 539 GGACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYE 598 

I : I I I I : I MM: : I I I I I I 
Db 550 -GSCIFLSPGTRAAFEQDVSGASTSGLGDC 578 

Qy 599 SRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRESYLKGHDQLVPVTLLAIAVILA 658 

I : : I I : I I I I I : : I 

Db 57 9 TGLLRASLSEDRAGLVSVNLLVTSSVAA 606 

Qy 659 FVMGAVFS GI TV- YCVCDHRRKDVAWQRKEKE — LTHSRRGSMSSVTKLSGLFGDTQSK 715 

I I : I I I I I : I : I I : : : I : I I : I I II : : I I : : I I : : : : 

Db 607 FWGAWSGFSVGWFVGLRERRELA — RRKDKEAILAHGAGEAVLSVSRL GERRAQ 660 

Qy 716 DP KPEAILTPLMHNGKLATPGNTAK-MLIKADQHHLDLTALPTPEST 761 

I 111:111111 III:: III I I I I I I 

Db 661 GPGGRGGGGGGGAGVP PEAL LAP LMQNG WAKATLLQGGPHDLDSGLLPTPEQT 713 

Qy 762 PTLQQKRKP S RGSREWERNQNLINACTKDMPPMGS PVI PTDLP LRASPSH 811 

I I I I I I : I I I : I I : : I I : I : 

Db 714 P-LPQKRLPTPHPHPHALGPRAWDH GH P L L PAS AS S S L L L LAP AR 757 

Qy 812 IPSVWLPITQQGYQHEYVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLV 871 

I I I : I : II: : : : I : : 

Db 758 APEQPPAPGEPTPDGRLYAARPGRASHGDFPLTPHASPDRRRW SAPTGPLDPA 811 

Qy 872 ENLDSLP-PKVPQREASL GPPGASLSQT 898 

I I I I I II I I I : I : I 

Db 812 SAADGLPRPWSPPPTGSLRRPLGPHAPPAATLRRT 84 6 



RESULT 4 
SM6B_RAT 

ID SM6B_RAT STANDARD; PRT; 887 AA. 

AC 070141; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Semaphorin 6B precursor (Semaphorin Z) (Sema Z) . 

GN SEMA6B. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Rattus. 
OX NCBI_TaxID=10116; 
RN [1] 



RP SEQUENCE FROM N.A. 

RC STRAIN=Wistar; TISSUE^Brain; 

RX MEDLINE=98087397; PubMed=9427525 ; 

RA Kikuchi K. , Ishida H . , Kimura T . ; 

RT "Molecular cloning of a novel member of sernaphorin family genes, 

RT sernaphorin Z."; 

RL Brain Res. Mol. Brain Res. 51:229-237(1997). 

CC -!- FUNCTION: MAY PLAY A ROLE IN BOTH PERIPHERAL AND CENTRAL NERVOUS 
CC SYSTEM DEVELOPMENT. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- DEVELOPMENTAL STAGE: DETECTED IN THE FIRST BRANCHIAL ARCH OF 

CC EMBRYONIC DAY 11 (Ell) EMBRYO, AND SUBSEQUENTLY IN THE MYOTOMES 

CC AND THE DORSAL ROOT GANGLIA IN DEVELOPING SOMITES FROM Ell. 5 

CC THROUGH El 3. 5, BUT NOT IN THE BRAIN. HOWEVER, AT El 5, 18, 21 AND 

CC PO, SEMAZ WAS HIGHLY EXPRESSED IN THE BRAIN. 

CC -!- SIMILARITY: Belongs to the sernaphorin family. 

CC -!- SIMILARITY: Contains 1 Sema domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AB000776; BAA25687.1; -. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

KW Signal; Transmembrane; Multigene family; Neurogenesis; Glycoprotein; 

KW Developmental protein. 



FT 


SIGNAL 


1 


26 


POTENTIAL 






FT 


CHAIN 


27 


887 


SEMAPHORIN 6B. 




FT 


DOMAIN 


27 


605 


EXTRACELLULAR (POTENTIAL). 


FT 


TRANSMEM 


606 


626 


POTENTIAL 






FT 


DOMAIN 


627 


887 


CYTOPLASMIC (POTENTIAL) 




FT 


DOMAIN 


239 


549 


SEMA. 






FT 


CARBOHYD 


75 


75 


N-LINKED 


(GLCNAC. . .) 


(POTENTIAL) 


FT 


CARBOHYD 


156 


156 


N-LINKED 


(GLCNAC. . .) 


(POTENTIAL) 


FT 


CARBOHYD 


168 


168 


N-LINKED 


(GLCNAC. . .) 


(POTENTIAL) 


FT 


CARBOHYD 


292 


292 


N-LINKED 


(GLCNAC. . .) 


(POTENTIAL) 


FT 


CARBOHYD 


387 


387 


N-LINKED 


(GLCNAC. . .) 


(POTENTIAL) 


FT 


CARBOHYD 


442 


442 


N-LINKED 


(GLCNAC. . .) 


(POTENTIAL) 


FT 


CARBOHYD 


463 


463 


N-LINKED 


(GLCNAC. . .) 


(POTENTIAL) 


SQ 


SEQUENCE 


887 AA; 


95752 


MW; 09543F3F202CD301 CRC64; 



Query Match 36.6%; Score 1997; DB 1; Length 887; 

Best Local Similarity 42.7%; Pred. No. 4.2e-121; 

Matches 433; Conservative 132; Mismatches 260; Indels 190; Gaps 25; 

Qy 2 RSEALLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQR — HRLDIQ 59 

I II II II- I : I : : : I I I I I I I I I I I I : I I 

Db 10 RPALLFLLLLLLRVTHGLFPDEPPPLSVAPRDYLSHYPVFVGSGPGRLTPAEGAEDLNIQ 69 



Qy 60 MIMIMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHN 119 

: : : I I I : I I I : : I I : : : I : I : : I I I I : I : I : I I I I I I I : I I I 
Db 70 RVLRVNRTLFIGDRDNLYQVELEPSTSTELRYQRKLTWRSNPSDIDVCRMKGKQEGECRN 129 

Qy 120 FIKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFAD 179 

1:1111 : : : II I I I : I I I I I I II I I II : II I I I I I I I I I ! I I I I I I I : I 
Db 130 FVKVLLLRDESTLFVCGSNAFNPICANYSMDTLQLLGDNISGMARCPYDPKHANVALFSD 189 

Qy 180 GKLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSPWLKEPYFVQAVDYGDYIYFFFREI 239 

I I : : I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I : : I : : I I I I I I I 
Db 190 GMLFTATVTDFLAIDAVIYRSLGDRPTLRTVKHDSKWFKEPYFVHAVEWGSHVYFFFREI 24 9 

Qy 240 AVE YNTMG KWF P RVAQVC KN DMGGS QRVL E KQWT S FL KARLN C S VP G D S H F Y FN I LQAV 299 

I : I : I : III I I I : I I I I I : I II I I I I I I I I I I I I I I I I I I II I I I I I I I I : I I I I 
Db 250 AMEFNYLEKVWSRVARVCKNDVGGSPRVLEKQWTSFLKARLNCSVPGDSHFYFNVLQAV 309 

Qy 300 TDVIRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDE 359 

I I : : II I : I I I I I I I I I I I I I I I I : I I : I : I I I I I : I I I I I : I I I I I I : : 
Db 310 TGWSLGGRPVILAVFSTPSNSIPGSAVCAFDMNQVAAVFEGRFREQKSPESIWTPVPED 369 

Qy 360 RVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYR 419 

: I I : I I I I I I I : I II I I : I I I : I I I II I I I I II I : : I I : I I : : I : : 

Db 370 QVPRPRPGCCAAPGM — QYNASNALPDEI LNFVKTHPLMDEAVPSLGHS PWI VRTLI RHQ 427 

Qy 420 LTKIAVDTAAGP YQNHTWFLGSEKGI I LKFLAR — IGNSGFLNDSLFLEEMSVYNSEKC 477 

I I : : I I I I I I : I I : I I I I I I I : I I I I : II I : I I I I I : : I 

Db 42 8 LTRVAVDVGAGPWGNQTIVFLGSEVGTVLKFLVKPNASVSGTTGPSIFLEEFETYRPDRC 4 87 

Qy 478 SYDGVEDKRIMGMQLDRASSSLYVAFSTCVIKVPLGRCERHGKCKKTCIASRDPYCG 534 

I I : | : : : : | | | | I I I I I : : I I : I I : : I I I I I : I I I I I 

Db 488 GRSSSAGEWGQRLLSLELDAASGGLLAAFPRCWRVPVARCQLYSGCMKNCIGSQDPYCG 547 

Qy 535 WIKEGGACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQ 594 

I : I : I II: Mill: : I I I I I I 

Db 548 WAPD-GSCIFLRPGTSATFEQDVSGASTSGLGDC 580 

Qy 595 EGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRESYLKGHDQLVPVTLLAIA 654 

I :: I I I I I I I : 

Db 581 TGLLRASLSDDRAGLVSVNLLVTS 604 

Qy 655 VI LAFVMGAVFSGITV-YCVCDHRRKDVAWQRKEKE — LTHSRRGSMSSVTKL 705 

: I I I : I I I I I : I : I |:::| :||:|| II :: 

Db 605 SVAAFWGAWSGFSVGWFVGLRERRELA — RRKDKEAI LAHGGS EAVLS VS RLGERRGT 662 

Qy 706 — SGLFGDTQSKDPKPEAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPT 7 63 

II I I I : I I I I I I I I : III I I I I I I I 

Db 663 GT GGRGGAGGGPGGP PEAL LAP LMQNGW TKAALLHGGPHDLDSGLLPTPEQTP- 715 

Qy 764 LQQKRKPSR GSREWERNQNLINACTK DMPPMGSPVIPTD — 802 

llllh | | | : : | : : | : I I : I I : 

Db 716 LPQKRLPTTHPHAHALGPRAWDHSHALLSASASTSLLLLAHTRAPEQPP VPTESG 77 0 

Qy 803 LPLRASPSHIPSVWLPITQQGYQHEYVDQPKMSEVAQMALEDQAATLEYKTIK 856 

I II : I ||::: | : : 
Db 771 PESRLCAPRSCRASHPGDFPLTP HAS PDRRRWSAPTGPLDS S SVG 816 

Qy 857 EHLSSKSPNHGVNLVENLDSLP-PKVPQREASL GPPGASLSQTGLSKRLEMHHS 909 



-II III 1*1 • I I • 

Db 817 DDLPGPWSPPATSSLRRPGPHGPPTAALRRT HT 84 9 

Qy 910 SSYGVDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGD-NPPPAP 963 

: I : I II ::|:|| I I : III 

Db 850 FNSG EARPGGHRPRRHA PADSTHL LPCGTGERTAPPVP 887 



RESULT 5 
SM6B_MOUSE 

ID SM6BJVIOUSE STANDARD; PRT; 886 AA. 

AC 054951; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Semaphorin 6B precursor (Semaphorin VIB) (Sema VIB) (Semaphorin N) 

DE (Sema N) . 

GN SEMA6B OR SEMAN. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=1009 0; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98027184; PubMed=9361278 ; 

RA Eckhardt F., Behar O., Calautti E., Yonezawa K., Nishimoto I., 

RA Fishman M. C. ; 

RT "A novel transmembrane semaphorin can bind c-src"; 

RL Mol. Cell. Neurosci. 9:409-419(1997). 

CC -!- SUBUNIT: HOMODIMER. BINDS SPECIFICALLY THE SH3 DOMAIN OF THE 
CC PROTOONCOGENE C-SRC. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- TISSUE SPECIFICITY: During development it is expressed in 

CC subregions of the nervous system and is particularly prominent in 

CC muscle. In adulthood, it is expressed ubiquitously. 

CC -!- SIMILARITY: Belongs to the semaphorin family. 

CC -!- SIMILARITY: Contains 1 Sema domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF036585; AAC00493.1; -. 

DR MGD; MGI : 1202889; Sema6b. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

KW Signal; Transmembrane; Multigene family; Neurogenesis; Glycoprotein; 

KW Developmental protein. 

FT SIGNAL 1 26 POTENTIAL. 

FT CHAIN 27 886 SEMAPHORIN 6B. 
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N-LINKED ( GLCNAC ♦ . . ) 


(POTENTIAL) 


FT 


CARrSOniD 


loo 


ICC 

iDO 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) 


FT 


CARBOHYD 


292 


292 


N-LINKED (GLCNAC. . . ) 


(POTENTIAL) 


FT 


CARBOHYD 


387 


387 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) 


FT 


CARBOHYD 


442 


442 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) 


FT 


CARBOHYD 


.463 


463 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) 


SQ 


SEQUENCE 


886 AA; 


95466 


MW; E5F56D125CDA574D CRC64; . 



Query Match 36.6%; Score 1995; DB 1; Length 886; 

Best Local Similarity 43.7%; Pred. No. 5.6e-121; 

Matches 438; Conservative 132; Mismatches 257; Indels 176; 



Gaps 28; 



Qy 

Db 

QY 
Db 

Qy 

Db 



13 



L Y FT LLH FAGAG FPEDSEPI SISHGNYTKQYPVFVGHKPGRNTTQR — HRLDIQMIM 62 

I I I I I III: I : I :: : I I I I I I I I II I MM : : 

LSFFLLLLLGVTYGLFPEEPPPLSVAPRDYLSHYPVFVGSGPGRLTAAEGAEDLNIQRVL 72 



63 IMNGTLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNFIK 122 

: I I I : I | | : : | | : : : | : | : : I I I I : I : I : I I I I I I I : I I I I : I 
73 RVNRTLFIGDRDNLYQVELEPSTSTELRYQRKLTWRSNPSDIDVCRMKGKQEGECRNFVK 132 



123 



182 



VLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADGKL 
III ::: I I I I I : I I I I I I II MM: II I I I I I I I I I I I I I II I I : I I I 
133 VLLLRDESTLFVCGSNAFNPICANYSMDTLQLLGDSISGMARCPYDPKHANVALFSDGML 192 



Qy 



Db 



183 YSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVE 242 

:: I I I M I I I I I I I I I I I I I : I I I I I I I I I II I I II I I I I I : : I : : I I I I I I I I : I 
193 FTATVTDFLAIDAVIYRSLGDRPTLRTVKHDSKWFKEPYFVHAVEWGSHVYFFFREIAME 252 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



243 YNTMG KWF P RVAQ VC KN DMGG S QRVL E KQWT S F L KARLN C S VP GD S H F Y FN I LQAVT D V 302 

: I : III I I I : I I I I I : I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I : I II I I I 
253 FN YL EKVWS RVARVC KNDVGG S P RVL E KQWT S FLKARLN C S VP GD S H F Y FNVLQAVT GV 312 



303 



362 



IRINGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDERVP 

: : II I : I I I I I I I I I II II I I I : I I : I : I I I I I : I I I I I : I II I I I : : : II 
313 VS LGGRPVI LAVFST P SN S I PGSAVCAFDMNQVAAVFEGRFREQKS PES I WT PVPEDQVP 372 

363 KPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRLTK 422 

: I I I I I I I : I I : I I : I I I : I I I I I I I I I I I I : : II : I I : : I : : I I : 

373 RPRPGCCAAPGM— QYNASSALPDEILNFVKTHPLMDEAVPSLGHSPWIVRTLMRHQLTR 430 

423 IAVDTAAGPYQNHTWFLGSEKGIILKFLAR — IGNSGFLNDSLFLEEMSVYNSEKC 477 

: I I I I I I : I I : I I I I I I I : I I I I : II I : I I I I I : : I 

431 VAVDVGAGPWGNQTIVFLGSEAGTVLKFLVKPNASVSGTTGPSIFLEEFETYRPDRCGRP 490 

478 SYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIK 537 

I I : I : : : : | | | I I II II : : I I : I I : : I I I I I : I I I I I I 
491 SSAGEWGQRLLSLELDAASGGLLAAFPRCWRVPVARCQLYSGCMKNCIGSQDPYCGWAP 550 

538 EGGACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGY 597 

: I : I II: Mill: : I I I I I I 

551 D-GSCIFLRPGTSATFEQDVSGASTSGLGDC 580 



Qy 598 ESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRESYLKGHDQLVPVTLLAIAVIL 657 

I : : I I I I I I I : : 

Db 581 TGLLRASLSDDRAGLVSVNLLVTSSVA 607 

Qy 658 AFVMGAVFSGITV-YCVCDHRRKDVAWQRKEKE — LTHSRRGSMSSVTKLSGLFGDTQS 714 

I I I : I I I I I : I : I I : : : I : I I : I I II : : I I : : I I : : ' 

Db 608 AFWGAWS GFS VGWFVGLRERRELA — RRKDKEAILAHGGSEAVLSVSRL GERRG 661 

Qy 715 KDP KPEAI LTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTP 762 

I 111:111111 I I : III I I I I I I I 

Db 662 TGPGGRGGAGGGPGGP PEAL LAP LMQNGW TKAALLHGGPHDLDTGLLPTPEQTP 715 

Qy 763 TLQQKRKP SRGSREWERNQNLINACTKDMPPMGS PVI PTDL PLRASPSH 811 

I I I I I : | | | | : : | : : | II I I I I 

Db 716 -LPQKRLPTPHPHAHALGSRAWDHSHALLSASAS TSLLLLAPARASEQ- 762 

Qy 812 I PSVWLPITQQGYQHEYVDQP KMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGV 868 

III: : | : | I I : : : I : I 

Db 7 63 -PQVPAEPGPE SRLCAPRSCRASHPGDFPLTPHASPDRRRWSAPTGPLDPSVG- 815 

Qy 8 69 NLVENLDSLP-PKVPQREASL GPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYP 921 

I I I I I : I I I I I I : I : I I : : I : I 

Db 816 DGLPGPWS PPATSSLRRPGPHGPPTAALRRT HTFNSG EARP 856 

Qy 922 TNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGD-NPPPAP 963 

II : : I : I I I I : III 

Db 857 GGHRPRRHP PADSTHL LPCGTGERTAPPVP 8 86 



RESULT 6 
SM6C_HUMAN 

ID SM6C_HUMAN STANDARD; PRT; 930 AA. 

AC Q9H3T2; Q8WXT8; Q8WXT9; Q8WXU0; Q96JF8; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Semaphorin 6C precursor (Semaphorin Y) (Sema Y) . 

GN SEMA6C OR SEMAY OR KIAA1869. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 1). 

RC TISSUE=Brain; 

RA Kimura T., Ishida H. ; 

RL Submitted (JAN-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3). 

RC TISSUE=Brain; 

RA Qu X., Zhai Y., Wei H., Zhang C, Xing G., Yu Y., Wu S., Zhang Y., 

RA Ouyang S., Zhou G., He F.; 

RL Submitted (JAN-2001) to the EMBL/GenBank/ DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORM 1). 

RC TISSUE=Brain; 

RX MEDLINE=21245130; PubMed=l 1347 906; 



RA Nagase T., Nakayama M. , Nakajima D., Kikuno R. , Ohara O. ; 

RT "Prediction of the coding sequences of unidentified human genes. XX. 

RT The complete sequences of 100 new cDNA clones from brain which code 

RT for large proteins in vitro."; 

RL DNA Res. 8:85-95(2001). 

CC -!- FUNCTION: May be a stop signal for the dorsal root ganglion 

CC neurons in their target areas, and possibly also for other 

CC neurons. May also be involved in the maintenance and remodeling of 

CC neuronal connections (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC - ! - ALTERNATIVE PRODUCTS : 

CC Event=Alternative splicing; Named isoforms=3; 

CC Name=l; Synonyms=Short 1; 

CC IsoId=Q9H3T2-l; Sequence=Displayed; 

CC Name=2; Synonyms=Short 2; 

CC IsoId-Q9H3T2-2; Sequence=VSP_006046, VSP_006047; 

CC Name=3; Synonyms=Long; 

CC IsoId=Q9H3T2-3; Sequence=VSP_006047; 

CC -!- SIMILARITY: Belongs to the semaphorin family. 

CC -!- SIMILARITY: Contains 1 Sema domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb~sib . ch) . 

CC 

DR EMBL; AB022434; BAB20670.1; -. 

DR EMBL; AF339152; AAL72098.1; -. 

DR EMBL; AF339153; AAL72099.1; -. 

DR EMBL; AF339154; AAL72100.1; -. 

DR EMBL; AB058772; BAB47498. 1; ALT_INIT. 

DR Genew; HGNC: 1074 0; SEMA6C. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

KW Signal; Transmembrane; Multigene family; Neurogenesis; Glycoprotein; 

KW Developmental protein; Alternative splicing. 
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POTENTIAL . 


FT 


CHAIN 


25 


930 


SEMAPHORIN 6C. 


FT 


DOMAIN 


25 


604 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


605 


625 


POTENTIAL. 


FT 


DOMAIN 


626 


930 


CYTOPLASMIC (POTENTIAL) . 


FT 


DOMAIN 


233 


540 


SEMA. 


FT 


DOMAIN 


662 


667 


POLY-PRO. 


FT 


DOMAIN 


752 


755 


POLY-PRO. 


FT 


CARBOHYD 


70 


70 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


286 


286 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


437 


437 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


VARSPLIC 


184 


223 


Missing (in isoform 2) . 


FT 








/FTId=VSP_006046. 


FT 


VARSPLIC 


586 


586 


Y -> YVLPGPGPSPGTPSPPSDAHPRPQSSTLGVHTR 


FT 








(in isoform 2 and isoform 3) . 



FT /FTId=VSP_006047. 

FT CONFLICT 125 125 I -> V (IN REF. 2; AAL72099) . 

FT CONFLICT 252 252 R -> K (IN REF. 1) . 

FT CONFLICT 455 455 P -> T (IN REF. 1 AND 3) . 

SQ SEQUENCE 930 AA; 99682 MW; 8AF8814ADBC84C8 8 CRC64; 

Query Match 27.8%; Score 1515.5; DB 1; Length 930; 

Best Local Similarity 35.8%; Pred. No. 5.3e-90; 

Matches 380; Conservative 136; Mismatches 356; Indels 189; Gaps 32; 

Qy 6 LLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMIMIMN 65 

I I I : I I | | | : I I : I I | I I : : III : : I 

Db 13 LLLLLSLPH-TQAAFPQDPLPLLISDLQGTSPLSWFRGLEDDAVAAEL-GLDFQRFLTLN 7 0 

Qy 66 GTLYI AARDH I YTVDI DT S HT- EEI YCS KKLTWKSRQADVDTCRMKGKHKDECHNFI KVL 124 

I I : I I I I I : : : I : I : : I I I I : I : II : I : : I I I I I : I : I : I I 

Db 71 RTLLVAARDHVFSFDLQAEEEGEGLVPNKYLTWRSQ — DVENCAVRGKLTDECYNYI RVL 12 8 

Qy 125 LKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADGKLYS 184 

: : I I I I I : I : I I I : I : : I : I : I I I I I I I : II : I I I : I I : I I I I 
Db 129 VPWDSQTLLACGTNSFSPVCRSYGITSLQQEGEELSGQARCPFDATQSNVAIFAEGSLYS 188 

Qy 185 ATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVEYN 244 

II II I I I I: I I I I I I II: I : I I I I I : II : I II I : : : I I : : I I II I I : : I I 

Db 189 ATAADFQASDAWYRSLGPQPPLRSAKYDSKWLREPHFVQALEHGDHVYFFFREVSVEDA 248 

Qy 245 TMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVTDVIR 304 

: I : I I 111:111 Mill I I : : I II I I I I I I I I I I I I I I I I : : I I I : I : 
Db 249 RLGRVQFSRVARVCKRDMGGSPRALDRHWTSFLKLRLNCSVPGDSTFYFDVLQALTGPVN 308 

Qy 305 INGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDERVPKP 364 

:: I I : I : I I I I I I I I I I I : : : I I I : I I I I : I I I I I I :: I I I I 
Db 309 LHGRSALFGVFTTQTNSIPGSAVCAFYLDEIERGFEGKFKEQRSLDGAWTPVSEDRVPSP 368 

Qy 365 RPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRLTKIA 424 

I I I I I I : : : I : I I I I I I I I I I : I I I I : : : I I I I I : : I 

Db 369 RPGSCAGVGGAALFSSSRDLPDDVLTFIKAHPLLDPAVP PVTHQP-LLTLTSRALLTQVA 427 

Qy 425 VDTAAGP YQNHT WFLGS EKGI I LKFLARI GNS GFLNDS LFLEEMS VYNS EKC S YDGVED 4 84 

II I I I : I I I : I I I I I : I I I I I I : : I I I : I : : I I 

Db 428 VDGMAGPHSNITVMFLGSNDGTVLKVLPPGGRSGG-PEPILLEEIDAYSPARCSGKRTAQ 486 

Qy 485 — KRIMGMQLDRASSSLYVAFSTCVIKVPLGRCERHGKCKKTCIASRDPYCGWIKEGGAC 542 

: I I : I : : I I I : I I I I I : : : I I II I II I : : : I : I I : I I I I I I I I 

Db 487 TARRIIGLELDTEGHRLFVAFSGCIVYLPLSRCARHGACQRSCLASQDPYCGWHSSRG-C 545 

Qy 543 SHLSPNSRLTFEQDIERGNTDGL — GDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESR 600 

: : : I I I : : I I I : : I : I 
Db 54 6 VDIRGSGGTDVDQ AGNQESMEHGDCQDG ATGSQSGPGDSAY 58 6 

Qy 601 GGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLKGHDQLVPVTLLAIAVI LAFV 660 

III: : I I : I I : I II 

Db 587 GVRRDLP PASAS RS VP I PLLLAS VAAAFA 615 

Qy 661 MGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRG SMSSVTKLSGL 708 

ill II: I I I II I I I I | : | : : | | 

Db 616 LGASVSGLLVSCAC — RR AHRRRGKDIETPGLPRPLSLRSLARLHG- 659 



Qy 709 FGDTQSKDPKP — EAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQ 766 

I : I : I : I I : : I | : I I I I I I I I I I 

Db 660 -GGPEPPPPSKDGDAVQTPQLYTTFLPPPEGVPPP ELACLPTPESTPELPV 7 09 

Qy 767 KRKPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLPLRASPSH IPSWVLPIT 821 

I : I I I I I I I I : I I : I I hi I 

Db 710 KHLRAAGD-PWEWNQNRNNA KEGPGRSRGGHAAGGPAPRVLVRP-- 752 

Qy 822 QQGYQHEYVDQPKMSEVAQMALEDQAATLEYKTIKEHL SSKSPNHGVNLVENLDS- 876 

I I : | | :: | | : I I II 

Db 753 PPPGCPGQ AVEVTTLEELLRYLHGPQPPRKGAEPPAPLTSR 793 

Qy 877 -LPPKVPQREASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQATTLK 935 

III: I I I I : lh: I II 
Db 7 94 ALPPE — PAPALLGGPSPRPHECASPLRLDV PPEGRCASAPA 833 

Qy 936 RNNTNSSNSSHL S RN Q S FGRGDN P P PAP Q RVDS I QVH S S Q P S GQAVT VS RQ P S 988 

I : : I II II II III II 

Db 834 -RPALSAPAPRLGVGGGRRLPFSGHRAPPALLTRV PSGGPSRYSGGPGKHL 883 

Qy 989 — LNAYNSLTRSGLKRTPSLKPDVPPKPSF-APLSTSMKPN 1026 

I III II : II II II 

Db 884 L YLGRPEGYRGRALKRVDVEKPQLS LKP PLVGPS S RQAVPN 924 



RESULT 7 
SM6C_MOUSE 

ID SM6C_MOUSE STANDARD; PRT; 931 AA. 

AC Q9WTM3; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Semaphorin 6C precursor (Semaphorin Y) (Sema Y) . 

GN SEMA6C OR SEMAY. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
OX NCBI_TaxID=10090; 
RN [1] 

RP SEQUENCE FROM N.A. 
RC TISSUE-Embryo; 

RX MEDLINE=99160821; PubMed=10049528 ; 

RA Kikuchi K. , Chedotal A., Hanafusa H., Ujimasa Y., de Castro F. , 
RA Goodman C . S . , Kimura T . ; 

RT "Cloning and characterization of a novel class VI semaphorin, 
RT semaphorin Y."; 

RL Mol. Cell. Neurosci. 13:9-23(1999). 

CC -!- FUNCTION: May be a stop signal for the dorsal root ganglion 

CC neurons in their target areas, and possibly also for other 

CC neurons. May also be involved in the maintenance and remodeling of 

CC neuronal connections (By similarity) . 

SUBCELLULAR LOCATION: Type I membrane protein. 
SIMILARITY: Belongs to the semaphorin family. 
SIMILARITY: Contains 1 Sema domain. 



CC 
CC 
CC 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 



CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AB013729; BAA76294.1; -. 

DR MGD; MGI: 1338032; Sema6c. 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00630; Sema; 1. 

KW Signal; Transmembrane; Multigene family; Neurogenesis; Glycoprotein; 

KW Developmental protein. 

FT SIGNAL 1 25 POTENTIAL. 

FT CHAIN 26 931 SEMAPHORIN 6C. 

FT DOMAIN 26 605 EXTRACELLULAR (POTENTIAL) . 

FT TRANSMEM 606 626 POTENTIAL. 

FT DOMAIN 627 931 CYTOPLASMIC (POTENTIAL). 

FT DOMAIN 234 541 SEMA. 

FT DOMAIN 663 669 POLY-PRO. 

FT DOMAIN 754 757 POLY- PRO. 

FT CARBOHYD 71 71 N-LINKED ( GLCNAC . . .) (POTENTIAL). 

FT CARBOHYD 287 287 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 438 438 N-LINKED (GLCNAC. . .) (POTENTIAL). 

SQ SEQUENCE 931 AA; 99537 MW; B0D9 9D5942 09F125 CRC64; 

Query Match 27.4%; Score 1493.5; DB 1; Length 931; 

Best Local Similarity 35.7%; Pred. No. 1.4e-88; 

Matches 374; Conservative 14 5; Mismatches 353; Indels 177; Gaps 32; 

LLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMIMIMN 65 
I I I : I I I I I : I I : I I I : : III:: 



I I : I I I I I : : : I : I : : I I I I : I : I : : I : : I I I I I : I : I : I I 



QY 


6 


Db 


13 


Qy 


66 


Db 


72 


Qy 


125 


Db 


130 


Qy 


185 


Db 


190 


Qy 


245 


Db 


250 


Qy 


305 


Db 


310 


Qy 


365 



I I I I I : I : I M : I : : I : I : I I I I I I I : I I : I I : I I : I I I I 



II II I I I I : I I I I I I II: I : I I I I I : I I : I I I : : : I : : : I I I I I I : : I I 



|:| I 111:111 I I I I I I I : : I I I I I I I I I I II I I I I I I I : : I I : : I 



II : I : I I I I I I I I I I I : : I I I I : I I I I : I I I I I I : :: I I 



Ml Ml : :::| : III I III llhl III ::l I I ll-l 



Db 



370 RPGSCAGVGAAASFSSSQDLPDDVLLFIKAHPLLDPAVPPATHQP-LLTLTSRALLTQVA 428 



Qy 425 VDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVED 484 

II I I I : : I I I : I I II I : I I I I : : : I I I : I : : I I I 

Db 42 9 VDGMAGPHRNTTVLFLGSNDGTVLKVLPP-GGQSLGSEPIVLEEIDAYSH7VRCS--GKRS 4 85 

Qy 485 KRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 540 

: I I : I : : I I hill I : : : I I I I I I I : : : I : I I I I I I I I : I 

Db 486 PRAARRIIGLELDTEGHRLFVAFPGCIVYLSLSRCARHGACQRSCLASLDPYCGWHRSRG 545 



Qy 541 ACSHLSPNSRLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYE-S 599 

II III I I I : I I I : I I 

Db 54 6 CMSIRGP GGTD VDLTGNQES TEHGDCQDGATGS 578 

Qy 600 RGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRESYLKGHDQLVPVTLLAIAVILAF 659 

: I I : III: : : I : I I I I I 

Db 579 QSGPGDSAY GVRRDLS PASASRSI PI PLLLACVAAAF 615 

Qy 660 VMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKP 719 

: I I I I : I I I I I : : I : I : : h Ml II 

Db 616 ALGAS VS G L LVS CAC — RRAN RRRSKDIETPGLPRPLSLRSLARLHGGGPEPPPPP 669 



Qy 720 EAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQQKRKPSRGSRE 776 

: I I I : : I : : I I I I I I : I I I I : I 

Db 670 KDGDAAQT PQL YTT FLP P P DGGS P P ELACLPTPETTPELPVKHLRASGG-P 719 

Qy 777 WERNQNLINACT-KDMPPMGSPVIPTDLP LRASPSHIPSVWLPITQQ GYQH — 827 

I I I I I I I III I : I I I I I : II 

Db 720 WEWNQNGNNASEGPGRPPRGCSGAGGPAPRVLVRPPPPGCPGQAVEVTTLEELLRYLHGP 779 

Qy 828 EYVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSPNHGVNLVENL 874 

: : I I I I I I I I I : I 

Db 780 QPPRKGSEPLASAPFTSRPPASEPGASLFVD SSPMPRDGVPPL-RL 824 

Qy 875 DSLPPK VPQREASLGPPGASLSQTGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQ 930 

I : I I : I : I I I I I : I I : I I I : 

Db 825 D-VPPEGKRAAPSGRPALSAPAPRLG-VGGSRRL PFPT HR 8 62 



Qy 931 ATTLKRNNTNSSNSSHLSRNQSFGRGDNPPPAPQRVDS-- IQVHSSQPSGQAVTVSRQPS 988 

I I I I I I : | | : : | | 

Db 8 63 A PPGLLTRVPSGGPARYSGGPGRHLLYLGR-PE 8 94 



Qy 989 LNAYNSLTRSGLKRTPSLKPDV — PPKPS 1015 

: III : I III: I I : I : 
Db 895 GH RGRS LKRVDVKS P L S P KP P LAS P PQ P A 923 



RESULT 8 
SM6C_RAT 

ID SM6C_RAT STANDARD; PRT; 960 AA. 

AC Q9WTL3; Q9WTM6; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Semaphorin 6C precursor (Semaphorin Y) (Sema Y) . 

GN SEMA6C OR SEMAY. 

OS Rattus norvegicus (Rat) . 



OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus . 

OX NCBI_TaxI D=l 0 116; 

RN [1] 

RP SEQUENCE FROM N.A. {ISOFORMS SEMA Y-L AND SEMA Y-S) . 

RC STRAIN=Sprague-Dawley; TISSUE=Muscle; 

RX MEDLINE=99160821; PubMed-10049528 ; 

RA Kikuchi K. f Chedotal A. , Hanafusa H., Ujimasa Y. , de Castro F. , 

RA Goodman C.S., Kimura T.; 

RT "Cloning and characterization of a novel class VI semaphorin, 

RT semaphorin Y."; 

RL Mol. Cell. Neurosci. 13:9-23(1999). 

CC -!- FUNCTION: SHOWS GROWTH CONE COLLAPSING ACTIVITY ON DORSAL ROOT 
CC GANGLION (DRG) NEURONS IN VITRO. MAY BE A STOP SIGNAL FOR THE DRG 

CC NEURONS IN THEIR TARGET AREAS, AND POSSIBLY ALSO FOR OTHER 

CC NEURONS. MAY ALSO BE INVOLVED IN THE MAINTENANCE AND REMODELING OF 

CC NEURONAL CONNECTIONS. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=Sema Y-L; 

CC IsoId=Q9WTL3-l; Sequence=Displayed; 

CC Name=Sema Y-S; 

CC IsoId=Q9WTL3-2; Sequence=VSP_006048 ; 

CC -!- TISSUE SPECIFICITY: Expressed in many regions of the developing 
CC nervous system, probably in neurons and their precursors, but also 

CC in nonneural tissue such as immature muscle and dermis. In adult, 

CC strong expression in the skeletal muscle and moderate expression 

CC in the brain, where cerebellum shows the highest expression. Also 

CC expressed in almost all areas of the CNS . 

CC -!- DEVELOPMENTAL STAGE: DETECTED AT E12 AND FOUND AT MARKEDLY 

CC INCREASED LEVELS AT E15 AND E18 IN BOTH THE HEAD AND THE BODY. AT 

CC BIRTH THE LEVEL DECREASES SIGNIFICANTLY. 

CC -!- SIMILARITY: Belongs to the semaphorin family. 

CC -!- SIMILARITY: Contains 1 Sema domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; AB000817; BAA76293.2; -. 

DR EMBL; AB014074; BAA76295.1; -. 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00630; Sema; 1. 

KW Signal; Transmembrane; Multigene family; Neurogenesis; Glycoprotein; 

KW Developmental protein; Alternative splicing. 

FT SIGNAL 1 23 POTENTIAL . 

FT CHAIN 24 960 SEMAPHORIN 6C. 

FT DOMAIN 24 635 EXTRACELLULAR (POTENTIAL) . 

FT TRANSMEM 636 656 POTENTIAL . 

FT DOMAIN 657 960 CYTOPLASMIC (POTENTIAL) . 

FT DOMAIN 234 541 SEMA. 



FT nnMZlTM 




699 

\J Z? _7 


POT V— P DM 




FT nnMZiTM 


7 ft "3 


7 ft 6 


pot V-Pun 




FT CARBOHYD 


69 


69 


N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT CARBOHYD 


285 


285 


N- LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT fARROHYD 


436 


436 


M_T TNKF*n fHT PMAf 


\ fPOTFMTTAT ^ 


FT VARSPLIC 


586 




ixij-ssmy I in lsoiorm 


Oclud I o ) . 


FT 






/ r llu-vor UUOU^io. 




SQ SEQUENCE 


960 AA; 


102610 


MW; C8 8293C5607E608 6 


CRC64; 


Query Match 




27.4% 


; Score 14 91; DB 1; 


Length 960; 


Best Local Similarity- 


35.1% 


Pred. No. 2.1e-88; 




Matches 374; 


Conservative 


14 9; Mismatches 363; 


Indels 180; 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



Gaps 32; 



6 LLLYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMIMIMN 65 

III II I I : I I : I I II: : I I I : : I 

11 LLLLLLSLPQAQTAFPQDPIPLLTSDLQGTSPSSWFRGLEDDAVAAEL-GLDFQRFLTLN 69 



66 



70 



125 



GTLYI AARDHI YTVDI DT SHT- EEI YCS KKLTWKS RQADVDTCRMKGKHKDECHNFI KVL 
I I : I I I I I : : : I : I : : I I I I : I : I : : I : : I I I I I : I : I : I I 

RTLLVAARDHVFSFDLQAQEEGEGLVPNKFLTWRSQ— DMENCAVRGKLTDECYNYIRVL 



LKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGDEFSGMARCPYDAKHANVALFADGKLYS 
: : I I I I I : I : I I I : I : : I : I : I I I I I I I : I I : I I : I : I I I I 
128 VPWDSQTLLACGTNSFSPVCRSYGITSLQQEGEELSGQARCPFDATQSTVAISAEGSLYS 



124 



127 



184 



187 



244 



18 5 ATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVEYN 

II II I llhlllll I II: I : I I I I I : I I : I I I : : : I I : : I I I I I I : : I I 
18 8 ATAADFQASDAWYRSLGPQPPLRSAKYDSKWLREPHFVYALEHGDHVYFFFREVSVEDA 247 

245 TMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVTDVIR 304 

: I : I I 111:111 I I I I I I I : : II I I I I I I I I I I I I I I I I I : : I I : : I : 
248 RLGRVQFSRVARVCKRDMGGSPRALDRHWTSFLKLRLNCSVPGDSTFYFDVLQSLTGPVN 307 

305 INGRDWLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDERVPKP 364 

: : I I : I : I I I I I I I I I I I : : I I I I : I II I : I I I I I I : : : I I I 

308 LHGRSALFGVFTTQTNSIPGSAVCAFYLDDIERGFEGKFKEQRSLDGAWTPVSEDKVPSP 367 

365 RPGCCAGS S SLERYATSNEFPDDTLNFI KTHPLMDEAVPS I FNRPWFLRTMVRYRLTKI A 424 

IM III : ::: I : I I I I I I I I I I : I I I I ::| I I ll::| 

368 RPGSCAGVGAAALFSSSQDLPDDVLLFIKAHPLLDPAVPPATHQP-LLTLTSRALLTQVA 426 



425 VDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVED 

II I I I :: I I I : I I I I I Ml I I : : I I I : I : Ml I 

427 VDGMAGPHRNTTVLFLGSNDGTVLKVLPP-GGQSLGPEPIILEEIDAYSHARCS — GKRS 



485 



484 



541 



544 



589 



599 



KRIMGMQLDRAS SSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWI KEGG 

: I I M : M I I M I I | : : : | | I M I I : : M : I I I I I M I : I 

PRAARRIIGLELDTEGHRLFVAFPGCIVYLSLSRCARHGACQRSCLASLDPYCGWHRFRG 



484 



483 



540 



543 



ACSHLSPNSRLTFEQDIE-RGNTDGL — GDCHNSFVALN GHSSSLL PSTTT 58 8 

: I I : : I I : : I I I : I I : : I I I : 

CVNIRGPGG TDVDLTGNQESMEHGDCQDGATGSQSGPGDSAYVLLGPGPSPETPS 598 

SDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRESYLKGHDQLVPV 64 8 

I I I I M III Ml I : : M : 

SPSDAHPGPQS STLGA HTQGVRRDLS PAS AS RS I P I 634 



Qy 64 9 TLLAIAVILAFVMGAVFSGITVYCVCDHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGL 7 08 

II I I I : I I Ihlll II: : I : I : : I : I : I 

Db 635 PLLLACVAAAFALGASVSGLLVSCAC— RRAN RRRSKDIETPGLPRPLSLRSLARL 688 

Qy 709 FGDTQSKDPKP EAILTPLMHNGKLATPGNTAKMLIKADQHHLDLTALPTPESTPTLQ 765 

I II : I II : : I I : : I I I I I I : I I I 

Db 68 9 HGGGPEPPPPPKDGDAAQTPQLYTTFLPPPEGGSPP ELACLPTPETTPELP 739 

Qy 766 QKRKPSRGSREWERNQNLINACTKDMPPMGSPVIPTDLP LRASPSHIPSVWLPITQ 822 

I : I I I I I I I I II I : I I I I I 

Db 74 0 VKHLRASGG-PWEWNQNGNNASEGPGRPRGCSAAGGPAPRVLVRPPPPGCPGQEVEVTTL 7 98 

Qy 823 Q GYQH EYVDQPKMSEVAQMALEDQAATLEYKTIKEHLSSKSP 864 

: I I : : I I I I III 
Db 799 EELLRYLHGPQPPRKGSEPLASAPFTSRPPASEPGAALFVD SSPMP 844 

Qy 865 NHGVNLVENLDSLPP KVPQREASLGPPGASLSQTGLSKRLEMHHSSSYG 913 

I : I I I : I I I : I I : I I 
Db 845 R DCVPPLRLDVPPDGKRAAPSGRPALSAPAPRLGVSG-SRRL 885 

Qy 914 VDYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQSFGRGDNPPPAPQRVDS — IQV 971 

: I I I : I I I I I I 

Db 886 PFPT HRA PPGLLT RVPSGGPSR 907 

Qy 972 HSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKPDV — PPKPS 1015 

: I I : : I I : III : I III: I I : I : 

Db * 908 YSGGPGRHLLYLGR-PDGHRGRSLKRVDVKSPLSPKPPIiATPPQPA 952 



RESULT 9 
SM1A SCHAM 



ID SM1A_SCHAM STANDARD; PRT; 7 30 AA. 

AC Q26473; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Semaphorin 1A precursor ( Semaphorin-I ) (Sema I) (Fasciclin IV) . 

GN SEMA- 1A OR FAS 4 . 

OS Schistocerca americana (American grasshopper) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Orthopteroidea; Orthoptera; Caelifera; Acridomorpha; 

OC Acridoidea; Acrididae; Cyrtacanthacridinae; Schistocerca. 

OX NCBI__TaxID=7009; 

RN [1] 

RP SEQUENCE FROM N.A. , AND PARTIAL SEQUENCE. 

RX MEDLINE=93040225; PubMed=14 18998 ; 

RA Kolodkin A.L., Matthes D.J., O'Connor T.P., Patel N.H., Admon A., 

RA Bentley D., Goodman C.S.; 

RT "Fasciclin IV: sequence, expression, and function during growth cone 

RT guidance in the grasshopper embryo."; 

RL Neuron 9:831-845(1992). 

CC -!- FUNCTION: PLAY A ROLE IN GROWTH CONES GUIDANCE. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- TISSUE SPECIFICITY: DYNAMICALLY EXPRESSED ON A SUBSET OF AXON 
CC PATHWAYS IN THE DEVELOPING CNS AND ON CIRCUMFERENTIAL BANDS OF 

CC EPITHELIAL CELLS IN DEVELOPING LIMB BUDS. 

CC -!- SIMILARITY: Belongs to the semaphorin family. 



CC -!- SIMILARITY: Contains 1 Sema domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; L00709; AAA29808.1; -. 

DR PIR; JH0798; JH0798. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR002165; Plexin_repeat . 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF01437; PSI; 1. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

KW Signal; Developmental protein; Transmembrane; Glycoprotein; 



KW 


Neurogenesis . 








FT 


SIGNAL 


1 


20 


POTENTIAL. 




FT 


CHAIN 


21 


730 


SEMAPHORIN 1A. 




FT 


DOMAIN 


21 


630 


EXTRACELLULAR (POTENTIAL). 


FT 


TRANSMEM 


631 


651 


POTENTIAL. 




FT 


DOMAIN 


652 


730 


CYTOPLASMIC (POTENTIAL) . 


FT 


DOMAIN 


215 


515 


SEMA. 




FT 


DOMAIN 


217 


222 


POLY-PHE. 




FT 


CARBOHYD 


44 


44 


N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


CARBOHYD 


71 


71 


N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


CARBOHYD 


163 


163 


N-LINKED (GLCNAC. . 


.) ( POTENTIAL) 


FT 


CARBOHYD 


2 67 


267 


N-LINKED (GLCNAC. . 


. ) (POTENTIAL) 


FT 


CARBOHYD 


360 


360 


N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


CARBOHYD 


539 


539 


N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


SQ 


SEQUENCE 


730 AA; 


81214 


MW; 6D62594 6E7F8E57F 


CRC64 ; 


Query Match 




17. 8^ 


h; Score 971.5; DB 1; 


Length 730; 



Best Local Similarity 34.4%; Pred. No. 4.8e-55; 

Matches 244; Conservative 124; Mismatches 224; Indels 117; Gaps 29; 

Qy 8 LYFTLLHFAGAGFPEDSEPISISHGNYTKQYPVFVGHKPGRNTTQR HRLDIQMI 61 

I : I I I : I I III I II I : : : : 

Db 11 LLWVALH — AAAWVNDVSP KMYVQF GEERVQRFLGNESHKDHFKLL 54 

Qy 62 MIMNGTLYIAARDHI YTVDID — TSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHN 119 

: : I : I I : : I : : I II: : : I I I : I : I I I : I : I I 

Db 55 EKDHNSLLVGARNIVYNISLRDLTEFTEQ RIEWHSSGAHRELCYLKGKSEDDCQN 109 

Qy 12 0 FIKVLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGD EFSGMARCPYDAKHANV 174 

: I : I I I : I I : : I I I I I : I I I : I : II hi I I : I I : 

Db 110 YIRVLAKIDDDRVLICGTNAYKPLCRHYALKD GDYWEKEYEGRGLCPFDPDHNST 165 

Qy 175 7VLFADGKLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYF 234 

I : : : : I : I I I I I I I I I : I I I I II : I I I III : : I I : I : I 

Db 166 AIYSEGQLYSATVADFSGTDPLIYRG PLRTERSDLKQLNAPNFVNTMEYNDFIFF 220 



Qy 



235 FFREIAVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFN 294 



I I I I I I I I I I : : I I I : I I I : I I I : : : I I I I I I : I I I II I I II I I I I 

Db 221 FFRETAVEYINCGKAIYSRVARVCKHDKGGPHQFGDR-WTSFLKSRLNCSVPGDYPFYFN 279 

Qy 295 I LQAVT DVI RIN- GRDV VLAT FST P YN S I PGS AVCAYDMLDI AS VFTGRFKEQKS PD 350 

: I : : I : I II I : I : I I I I I I I I I I I : I I I I I I I I : : : 
Db 280 EIQSTSDI I EGNYGGQVEKLI YGVFTTPVNS I GGSAVCAFSMKS I LES FDGPFKEQETMN 339 

Qy 351 STWTPVPDERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPW 410 

II II : I I : I I I I I I I I :: I I : I : I I I I I I I I : I I I 

Db 340 SNWLAVPSLKVPEPRPGQCVND S RTL P DVS W FVKS HTLMDEAVPAFFT RP I 391 

Qy 411 FLRTMVRYRLTKIAVD TAAGP YQNHTWFLGSEKGI I LKFLARI GNS GFLN 4 61 

: I :: I I I I I I I I I I : : I : I : I : : I : : I I II : 

Db 392 LIRISLQYRFTKIAVDQQVRTPDG — KAYDVLFI GTDDGKVI KAL NSASFDSSDTV 445 

Qy 462 DSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVIKVPLGRC— ERHG 519 

I I : : I I : I II I : : : : I III : : : I I I : : 

Db 446 DSWI EELQVLP PGVPVKNLYWRMDGDDSKLVWSDDEILAIKLHRCGSDKIT 499 

Qy 52 0 KCKKTCIASRDPYCGWIKEGGACSHL-SPN SRLTFEQDIERGNTDGLGDCHNSFVAL 575 

I : : I :: : I I I I I I : : I I : : I I : I I I : : 

Db 500 NCRE-CVSLQDPYCAWDNVELKCTAVGSPDWSAGKRRFIQNISLGEHKACGGRPQTEIV- 557 

Qy 576 NGHSSSLLPSTTTSDSTA QEGYESRGGMLDWKHLLDS PDSTDPLGAVS SHNHQ 628 

: I : I : I : I : I : I : | : : : I I : : I I 

Db 558 AS PVPTQPTTKSSGDPVHSIHQAEFEPE IDNEIVIGVDDSNVIPNTLAEINHA 610 

Qy 629 DKKGVIRESYLKGHDQLVPV TL-LAI AVI LAFVMGAVFS 666 

I I : : I .: I I : I I I : : : I : I : I I 

Db 611 GSK LPSSQEKLPIYTAETLTIAIVTSCLGALWGFISGFLFS 652 



RESULT 10 
SM1A_DR0ME 

ID SM1A_DR0ME STANDARD; PRT; 771 AA. 

AC Q24322; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Semaphorin 1A precursor ( Semaphorin-I ) (Sema I). 

GN SEMA- 1A OR DSEMA-I OR CG18405. 

OS Drosophila melanogaster (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota ; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophilidae; Drosophila. 

OX NCBI__TaxID=7227; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Embryo; 

RX MEDLINE=94094332; PubMed=8269517 ; 

RA Kolodkin A.L., Matthes D.J., Goodman C.S.; 

RT "The semaphorin genes encode a family of transmembrane and secreted 

RT growth cone guidance molecules."; 

RL Cell 75:1389-1399(1993). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Berkeley; 



RX MEDLINE=20196006; PubMed=10731 132 ; 

RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., 

RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., 

RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N., 

RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q. , Chen L.X., 

RA Brandon R.C., Rogers Y.-H.C., Blazej R.G., Champe M. , Pfeiffer B.D., 

RA Wan K.H., Doyle C, Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., 

RA Abril J.F., Agbayani A., An H.-J., Andrews-Pf annkoch C, Baldwin D., 

RA Ballew R.M. , Basu A., Baxendale J. , Bayraktaroglu L., Beasley E.M., 

RA Beeson K.Y., Benos P.V., Berman B.P., Bhandari D . , Bolshakov S., 

RA Borkova D. f Botchan M.R., Bouck J., Brokstein P., Brottier P . , 

RA Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., 

RA Cherry J.M. , Cawley S., Dahlke C, Davenport L.B., Davies P., 

RA de Pablos B., Delcher A. , Deng Z., Mays A. D . , Dew I., Dietz S.M., 

RA Dodson K. , Doup L.E., Downes M. , Dugan-Rocha S., Dunkov B.C., Dunn P., 

RA Durbin K.J., Evangelista C.C., Ferraz C, Ferriera S., Fleischmann W., 

RA Fosler C, Gabrielian A.E., Garg N.S., Gelbart W.M. , Glasser K., 

RA Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M. , 

RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., 

RA Hostin D., Houston K.A., Howland T.J., Wei M.-H., Ibegwam C, 

RA Jalali M. , Kalush F. , Karpen G.H., Ke Z. r Kennison J. A., Ketchum K.A. , 

RA Kimmel B.E., Kodira CD., Kraft C. f Kravitz S., Kulp D., Lai Z., 

RA Lasko P., Lei Y., Levitsky A. A. , Li J.H., Li Z., Liang Y . , Lin X., 

RA Liu X., Mattei B., Mcintosh T.C., McLeod M.P., McPherson D., 

RA Merkulov G., Milshina N.V., Mobarry C, Morris J., Moshrefi A., 

RA Mount S.M., Moy M. , Murphy B., Murphy L., Muzny D.M., Nelson D.L., 

RA Nelson D.R., Nelson K.A., Nixon K. r Nusskern D.R., Pacleb J.M., 

RA Palazzolo M. , Pittman G.S., Pan S., Pollard J. , Puri V., Reese M.G., 

RA Reinert K. r Remington K., Saunders R.D.C., Scheeler F. , Shen H., 

RA Shue B.C., Siden-Kiamos I., Simpson M. , Skupski M.P., Smith T., 

RA Spier E., Spradling A.C., Stapleton M. , Strong R. , Sun E . , 

RA Svirskas R. , Tector C, Turner R. , Venter E., Wang A.H., Wang X., 

RA Wang Z.-Y., Wassarman D.A., Weinstock G.M., Weissenbach J., 

RA Williams S.M., Woodage T. , Worley K.C., Wu D., Yang S., Yao Q.A. , 

RA Ye J., Yeh R.-F., Zaveri J.S., Zhan M. , Zhang G., Zhao Q. , Zheng L.,- 

RA Zheng X.H., Zhong F.N., Zhong W. , Zhou X., Zhu S., Zhu X., Smith H.O., 

RA Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.; 

RT "The genome sequence of Drosophila melanogaster . " ; 

RL Science 287:2185-2195(2000). 

CC -!- FUNCTION : PLAY A ROLE IN GROWTH CONES GUIDANCE. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- TISSUE SPECIFICITY: EXPRESSED BY SUBSETS OF NEURONS AND MUSCLES. 

CC -!- DEVELOPMENTAL STAGE: EXPRESSION BEGINS AROUND STAGE 10, PRIMARILY 
CC IN THE DEVELOPING CNS . IN STAGE 16 EMBRYOS, IT IS EXPRESSED AT 

CC HIGHEST LEVELS THROUGHOUT THE CNS, AND WEAK EXPRESSION IS SEEN IN 

CC PORTIONS OF THE PERIPHERAL NERVOUS SYSTEM, MOST CLEARLY IN THE 

CC LATERAL SENSORY CLUSTERS. 

CC -!- SIMILARITY: Belongs to the semaphorin family. 

CC -!- SIMILARITY: Contains 1 Sema domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 



cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



EMBL; L26082; AAA88789.1; 
EMBL; AE003621; AAF52696.1; -. 
FlyBase; FBgn0011259; Sema-la. 
InterPro; IPR003659; Plexin-like. 
InterPro; IPR002165; Plexin_repeat . 
InterPro; IPR001627; Sema. 
Pfam; PF01437; PSI; 1. 
Pfam; PF01403; Sema; 1. 
SMART; SM00423; PSI; 1. 



SMART; SM00630; Sema; 1. 






Signal; Developmental protein; Transmembrane; Glycoprotein; 


Neurogenesis . 








SIGNAL 


1 


19 




POTENTIAL . 


CHAIN 


20 


771 




SEMAPHORIN 1A. 


DOMAIN 


21 


608 




EXTRACELLULAR (POTENTIAL) . 


TRANSMEM 


609 


629 




POTENTIAL. 


DOMAIN 


630 


771 




CYTOPLASMIC (POTENTIAL) . 


DOMAIN 


210 


519 




SEMA. 


DOMAIN 


689 


694 




POLY- PRO. 


CARBOHYD 


41 


41 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


CARBOHYD 


68 


68 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


CARBOHYD 


138 


138 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


CARBOHYD 


158 


158 




N-LINKED (GLCNAC. . . ) (POTENTIAL) 


CARBOHYD 


262 


262 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


CARBOHYD 


355 


355 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


SEQUENCE 


771 AA; 


86935 MW; 


752E7DD516F32DA5 CRC64; 


Query Match 




17. 


1%; 


Score 932.5; DB 1; Length 771; 


Best Local 


Similarity 


35. 


0%; 


Pred. No. 1.7e-52; 



Matches 219; Conservative 106; Mismatches 213; Indels 87; Gaps 



20; 



Qy 

Db 



67 TLYIAARDHIYTVDIDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNFIKVLLK 126 

: I I M : : : : I : : : : I I I : I I : I I I : : I I : I : : : : 

57 SLLIGARNTVFNLSI HDLVEQQRLWTSPEDDTKMCLVKGKDEEACQNYIRIMW 111 



QY 
Db 

Qy 

Db 



127 KNDDALFVCGTNAFNPSCRNYKMD TLEPFGDEFSGMARCPYDAKHANVALFADGKL 182 

: I I I I I I I : I I I I : III : | | I | | I : I : : : I I : I 

112 PSPGRLFVCGTNSFRPMCNTYIISDSNYTLEA TKNGQAVCPYDPRHNSTSVLADNEL 168 

183 YSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYIYFFFREIAVE 242 

I I II I I I : II I I : I : : I I I III: I I : : I I I I I I I I I 

169 YSGTVADFSGSDPIIYRE PLQTEQYDSLSLNAPNFVSSFTQGDFVYFFFRETAVE 223 



Qy 



Db 



243 YNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVTDV 302 

: I I :: I I I : II I I I I I : I I I I I I : I I I II : I I I I I I I : I : : : : 

224 FINCGKAIYSRVARVCKWDKGGPHR-FRNRWTSFLKSRLNCSIPGDYPFYFNEIQSASNL 282 



Qy 

Db 

Qy 

Db 



303 IR INGRDVVLATFSTPYNS I PGSAVCAYDMLDIASVFTGRFKEQKS PDSTWT PVPD 358 

: : : I : I I I I I I I I I I I I : : I I I I I : I I I I : I I I I : 

283 VEGQYGSMSSKLIYGVFNTPSNSIPGSAVCAFALQDIADTFEGQFKEQTGINSNWLPVNN 342 

359 ERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRY 418 

: I I I I I I I : I I I I I I I I I I I I I I I I I : I I : I I I 

343 AKVPDPRPGSC HNDSRALPDPTLNFIKTHSLMDENVPAFFSQPILVRTSTIY 394 



Qy 



419 RLTKIAVD TAAGPYQNHTWFLGSEKGIILKFL-ARIGNSGFLNDSLFLEEMSVYN 473 



Db 395 R FT Q I AVDAQ I KT P GG — KT YDVI FVGTDHGKI I KSVNAESADSADKVTS WI EEI DVLT 452 

Qy 474 S EKCS YDGVEDKRIM- GMQLDRAS S S L Y VAFSTCVIKVPLGRC — ERHGKCKKT 524 

: : : I : I I I : I : : I : : I I I : : I : 

Db 453 KS EPIRNLEIVRTMQYDQPKDGSYDDGKLIIVTDSQWAIQLHRCHNDKITSCSE- 507 

Qy 525 CI ASRDPYCGWI KEGGAC- SHLS PN — SRLTFEQDI ERGNTDGLGDCHNS FVALNGHS S S 581 

I : I : I I I I I I I I II : I I I : : I I - 

Db 508 CVALQDPYCAWDKIAGKCRSHGAPRWLEENYFYQNVATGQ HAAC 551 

Qy 582 LLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRESYLKG 641 

: I : I I II I II I II I I 

Db 552 P S GKINS KDANAGEQKGFRNDM DLLDS RRQSKDQEIIDNIDKN 594 

Qy 642 HDQLVPVTLLAIAVI LAFVMGAVFS 666 

: : : : : : | : | : : I I 

Db 595 FEDI INAQYTVETLVMAVLAGS I FS 619 



RESULT 11 
SM3A RAT 



ID SM3A_RAT STANDARD; PRT; 772 AA. 

AC Q63548; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Semaphorin 3A precursor (Semaphorin III) (Sema III) . 

GN SEMA3A. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBIJTaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Wistar; TISSUE=Brain; 

RX MEDLINE=97073089; PubMed=8915837 ; 

RA Giger R. J. , Wolfer D.P., De Wit G.M.J., Verhaagen J. ; 

RT "Anatomy of rat semaphorin III/collapsin-1 mRNA expression and 

RT relationship to developing nerve tracts during neuroembryogenesis . " ; 

RL J. Comp. Neurol. 375:378-392(1996). 

CC -!- FUNCTION: MAY BE INVOLVED IN GUIDING GROWING AXONS TOWARDS THEIR 

CC TARGETS BY FORMING A MOLECULAR BOUNDARY THAT INSTRUCTS AXONS TO 

CC ENGAGE IN THE FORMATION OF SPECIFIC NERVE TRACTS. BINDS TO 

CC NEUROPILIN. 

CC -!- SUBCELLULAR LOCATION: Secreted (By similarity). 

CC -!- DEVELOPMENTAL STAGE: AT E-ll, EXPRESSION WAS RESTRICTED TO THE 

CC OLFACTORY PIT, THE BASAL AND ROSTRAL SURFACE OF THE TELENCEPHALIC 

CC VESICLE, THE EYE ANLAGE, THE EPITHELIUM OF RATHKE 1 S POUCH, AND 

CC SOMITES. AT LATER DEVELOPMENTAL STAGES, IT WAS WIDELY DISTRIBUTED 

CC IN NEURONAL AS WELL AS IN MESENCHYMAL AND EPITHELIAL STRUCTURES 

CC OUTSIDE THE NERVOUS SYSTEM. AFTER BIRTH, MESENCHYMAL LEVELS 

CC DECREASED RAPIDLY AND EXPRESSION BECAME RESTRICTED TO SPECIFIC 

CC SETS OF NEURONS IN THE CNS . IN THE MATURE CNS, IT IS DETECTABLE IN 

CC MITRAL CELLS, NEURONS OF THE ACCESSORY BULB AND CEREBRAL CORTEX, 

CC CEREBELLAR PURKINJE CELLS, AS WELL AS A SUBSET OF CRANIAL AND 

CC SPINAL MOTONEURONS . 



CC -!- DOMAIN : Strong binding to neuropilin is mediated by the carboxy 
CC third of the protein. 

CC -!- SIMILARITY: Belongs to the semaphorin family. 

CC -!- SIMILARITY: Contains 1 Sema domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X95286; CAA64607.1; -. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003599; Ig. 

DR InterPro; IPR003659; Plexin-like . 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00409; IG; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

DR PROSITE; PS50835; IG_LIKE; 1. 

KW Signal; Immunoglobulin domain; Multigene family; Neurogenesis; 

KW Developmental protein; Glycoprotein. 



FT 


SIGNAL 


1 


20 


POTENTIAL. 


FT 


CHAIN 


21 


772 


SEMAPHORIN 3A. 


FT 


DOMAIN 


240 


538 


SEMA. 


FT 


DOMAIN 


577 


665 


IG-LIKE C2-TYPE. 


FT 


DOMAIN 


728 


770 


ARG/LYS-RICH (BASIC) . 


FT 


DISULFID 


650 


723 


BY SIMILARITY. 


FT 


CARBOHYD 


53 


53 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


125 


125 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


591 


591 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


SQ 


SEQUENCE 


772 AA; 


88808 


MW; 240907812FF9F2D2 CRC64; 


Query Match 




15. 7 5 


h; Score 858; DB 1; Length 772; 



Best Local Similarity 32.3%; Pred. No. l.le-47; 

Matches 209; Conservative 104; Mismatches 251; Indels 84; Gaps 20; 

Qy 47 GRNTTQRHRLDIQMIMIMNGT LYIAARDHIYTVDIDT 83 

|:| | :| : :: | ||: |:|||:: :: 

Db 25 GKNNVPRLKLSYKEMLESNNVITFNGLANSSSYHTFLLDEERSRLYVGAKDHIFSFNL — 82 

Qy 84 SHTEEIYCSKKLTWKSRQADVDTCRMKGKH-KDECHNFIKVLLKKNDDALFVCGTNAFNP 142 

I : I : I II: II MINIMI I : I I I I I : I 

Db 83 VNIKDFQKIVWPVSYTRRDECKWAGKDILKECANFIKVLKAYNQTHLYACGTGAFHP 139 

Qy 143 SCR NYKMDTLEPFGDEF — S GMARC P YDAKHAN VAL FADGKL YS AT VT D FLAI DA 195 

I :: I : I : I : I I I I : I I I : I I I I I I : I 

Db 140 ICTYIEVGHHPEDNIFKLQDSHFENGRGKSPYDPKLLTASLLIDGELYSGTAADFMGRDF 199 

Qy 196 VIYRSLGESPTLRTVKHDSKWLKEPYFVQA VDYGDYIYFFFREIAVEYNTMGK 24 8 

I : I : I I : I I : I I I : I I : I I : I I : I I I I I I I : : II 

Db 200 AIFRTLGHHHPIRTEQHDSRWLNDPRFISAHLIPESDNPEDDKVYFFFRENAIDGEHSGK 259 



Qy 24 9 WF P RVAQVC KN DMG G S Q RVLE KQWT S FL KARLN C S VP G DSHFYFNILQAVTDVIR 304 

I : I : I I I I I I II : I I : I I I I I I I I I I I I : II =11 
Db 260 ATHARIGQICKNDFGG-HRSLVNKWTTFLKARLICSVPGPNGIDTHF DELQDVFL 313 

Qy 305 INGRD WIATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

: I : I : I I : I I I I I I I I I I : III: : I : I I I 

Db 314 MNSKDPKNPIVYGVFTTSSNIFKGSAVCMYSMSDVRRVFLGPYAHRDGPNYQWVPY-QGR 372 

Qy 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

I I I I I I I I : : :: : I I | : | :: | | | I I I I I :: I I I : 
Db 373 VPYPRPGTCP-SKTFGGFDSTKDLPDDVITFARSHPAMYNPVFPINNRPIMIKTDVNYQF 431 

Qy 421 TKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKCSYD 480 

|:||| : | : | : | : : | : | | : : : : : I I I I : I : 
Db 432 TQ I WD RVDAE D GQ Y D VM F I GT D VGT VL KWS VP KET WH D L E E VL L E EMT VF R 484 

Qy 481 GVEDKRIMGMQLDRASSSLYVAFSTCVIKVPLGRCERHGKCKKTCIASRDPYCGWIKEGG 54 0 

I I I : I II: : I : : I I I I : : I I I : I I I I I I : I 

Db 485 — EPTTISAMELSTKQQQLYIGSTAGVAQLPLHRCDIYGKACAECCLARDPYCAW — DGS 540 

Qy 541 ACSHLSPNS-RLTFEQDIERGNTDGLGDC HNSFVALNGHS — SSLLPSTTTSDST 592 

: I I 1:111111111 I : : : I I I : : I : 

Db 541 SCSRYFPTAKRRTRRQDIRNG — DPLTHCSDLQHHDNH HGHSLEERIIYGVENSSTF 595 

Qy 593 AQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVIRESYLK 640 

: : I : : : I : : I : : I I : : : I : : I I 

Db 596 LECSPKSQRALVYWQFQRRNEDRKEEI-RVGDHIIRTEQGLLLRSLQK 642 



RESULT 12 




SM3A_ 


MOUSE 




ID 


SM3A MOUSE STANDARD; PRT; 772 AA. 




AC 


008665; Q62180; Q62215; 




DT 


30-MAY-2000 (Rel. 39, Created) 




DT 


30-MAY-2000 (Rel. 39, Last sequence update) 




DT 


15-MAR-2004 (Rel. 43, Last annotation update) 




DE 


Semaphorin 3A precursor (Semaphorin III) (Sema III) 


(Semaphorin D) 


DE 


(Sema D) . 




GN 


SEMA3A OR SEMAD OR SEMD. 




OS 


Mus mus cuius (Mouse) . 




OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebra ta; 


Euteleostomi; 


OC 


Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; 


Murinae; Mus. 


OX 


NCBI TaxID=10090; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=NMRI; TISSUE^Embryo; 




RX 


MEDLINE=952 67431; PubMed=774 8561 ; 




RA 


Pueschel A.W., Adams R.H., Betz H.; 




RT 


"Murine semaphorin D/collapsin is a member of a diverse gene family 


RT 


and creates domains inhibitory for axonal extension. 


r 


RL 


Neuron 14:941-948(1995). 




RN 


[2] 




RP 


SEQUENCE FROM N.A. 




RX 


MEDLINE=97 470885; PubMed=933134 5; 




RA 


Taniguchi M. , Yuasa S., Fujisawa H., Naruse I., Saga 


S . , Mishina M. , 


RA 


Yagi T . ; 





RT "Disruption of semaphorin III/D gene causes severe abnormality in 

RT peripheral nerve projection."; 

RL Neuron 19:519-530(1997). 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Kimura T., Fishman M.C.; 

RT "cDNA sequence of mouse collapsin/semaphorin III."; 

RL Submitted (FEB-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6; TISSUE=Brain ; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L . , Shenmen CM., Schuler G.D., 

RA Altschul S.F. f Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R. F. , Jordan H . , Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K., Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M. r Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X. , Gibbs R.A. , 

RA Fahey J. , Helton E. f Ketteman M. , Madan A. , Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J. , Myers R.M., 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

RN [5] 

RP SEQUENCE OF 107-772 FROM N.A. 

RC TISSUE-Fetal brain; 

RX MEDLINE=95267432; PubMed=774 8562 ; 

RA Messersmith E.K., Leonardo E.D., Shatz C.J., Tessier-Lavigne M. , 

RA Goodman C.S., Kolodkin A.L.; 

RT "Semaphorin III can function as a selective chemorepellent to pattern 

RT sensory projections in the spinal cord."; 

RL Neuron 14:949-959(1995). 

CC -!- FUNCTION: Play a role in growth cones guidance. May function to 
CC pattern sensory projections by selectively repelling axons that 

CC normally terminate dorsally. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- DEVELOPMENTAL STAGE: EXPRESSED EARLY IN EMBRYONIC DEVELOPMENT 
CC (Ell) IN DISTINCT REGIONS OF THE NEUROECTODERM AND MESODERM. 

CC EXPRESSION BECAME MORE EXTENSIVE AT LATER STAGES. 

CC -!- DOMAIN: Strong binding to neuropilin is mediated by the carboxy 
CC third of the protein. 

CC -!- SIMILARITY: Belongs to the semaphorin family. 

CC -!- SIMILARITY: Contains 1 Sema domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 



CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X85993; CAA59985.1; 

DR EMBL; D85028; BAA19773.1; 

DR EMBL; L41541; AAL77611.1; -. 

DR EMBL; BC057588; AAH57588.1; 

DR EMBL; L40484; AAA73934.1; -. 

DR PIR; 148747; 148747. 

DR PIR; 158169; 158169. 

DR MGD; MGI: 107558; Sema3a. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003599; Ig. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00409; IG; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

DR PROSITE; PS50835; IG_LIKE; 1. 

KW Signal; Immunoglobulin domain; Multigene family; Neurogenesis ; 

KW Developmental protein; Glycoprotein. 



FT 


SIGNAL 


1 


20 


POTENTIAL. 




FT 


CHAIN 


21 


772 


SEMAPHORIN 3A. 




FT 


DOMAIN 


240 


538 


SEMA. 




FT 


DOMAIN 


579 


665 


IG-LIKE C2-TYPE. 




FT 


DOMAIN 


728 


770 


ARG/LYS-RICH (BASIC) . 




FT 


DISULFID 


650 


723 


BY SIMILARITY. 




FT 


CARBOHYD 


53 


53 


N-LINKED ( GLCNAC . . .) 


(POTENTIAL) 


FT 


CARBOHYD 


125 


125 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) 


FT 


CARBOHYD 


591 


591 


N-LINKED (GLCNAC. . .) 


( POTENTIAL) 


FT 


CONFLICT 


193 


193 


D -> N (IN REF. 5) ■ 




FT 


CONFLICT 


207 


207 


H '-> D (IN REF. 1) . 




FT 


CONFLICT 


253 


253 


D -> G (IN REF. 1) . 




FT 


CONFLICT 


352 


352 


F -> L (IN REF. 5) . 




FT 


CONFLICT 


403 


403 


A -> G (IN REF. 1) . 




FT 


CONFLICT 


571 


572 


QH -> ED (IN REF. 1) . 




FT 


CONFLICT 


616 


620 


EDRKE -> RRSKR (IN REF. 


1) . 


FT 


CONFLICT 


623 


623 


R -> K (IN REF. 5) . 




SQ 


SEQUENCE 


772 AA; 


88799 


MW; E89A08528B10AEC3 CRC64; 



Query Match 15.7%; Score 853; DB 1; Length 772; 

Best Local Similarity 32.2%; Pred. No. 2.3e-47; 

Matches 208; Conservative 102; Mismatches 257; Indels 78; Gaps 18; 

Qy 47 GRNTTQRHRLDIQMIMIMNGT LYIAARDHIYTVDIDT 8 3 

|:| | :| : :: | ||: |:|||:: :: 

Db 25 GKNNVPRLKLSYKEMLESNNVITFNGLANSSSYHTFLLDEERSRLYVGAKDHIFSFNL— 82 



Qy 84 SHTEEIYCSKKLTWKSRQADVDTCRMKGKH-KDECHNFIKVLLKKNDDALFVCGTNAFNP 142 

I : | : | II: II I I I I I I I I I I : I I I I I : I 

Db 83 VNIKDFQKIVWPVSYTRRDECKWAGKDILKECANFIKVXEAYNQTHLYACGTGAFHP 139 

Qy 143 SCR NYKMDTLEPFGDEF— SGMARCPYDAKHANVALFADGKLYSATVTDFLAIDA 195 



I :: I : I : I : I I I I : I I I : I I I I I I : I 

Db 140 ICTYIEVGHHPEDNIFKLQDSHFENGRGKSPYDPKLLTASLLIDGELYSGTAADFMGRDF 199 

Qy 196 VIYRSLGESPTLRTVKHDSKWLKEPYFVQA VDYGDYIYFFFREIAVEYNTMGK 248 

I : I : I I : I I : I I I : I I : I I : I I : I I I I I I I : : I I 

Db 200 AIFRTLGHHHPIRTEQHDSRWLNDPRFISAHLIPESDNPEDDKVYFFFRENAIDGEHSGK 259 

Qy 249 WF P RVAQ VC KN DMGG S Q RVL E KQWT S F L KARLN C S VP G DSHFYFNILQAVTDVIR 304 

I : I : I I I I I I II : I I : I I I I I I I I I I I I : I I : I I 

Db * 260 ATHARIGQICKNDFGG-HRSLVNKWTTFLKARLICSVPGPNGIDTHF DELQDVFL 313 

Qy 305 INGRD WLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVPDER 360 

: I : I : I I : I I I I I I I I I I : III: : I : II I 

Db 314 MNSKDPKNPIVYGVFTTSSNIFKGSAVCMYSMSDVRRVFLGPYAHRDGPNYQWVPY-QGR 372 

Qy 361 VPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVRYRL 420 

I I I I I I I I : : : : : | | | : I : : I I I I I I I I : : I I I : 
Db 373 VPYPRPGTCP-SKTFGGFDSTKDLPDDVITFARSHPAMYNPVFPINNRPIMIKTDVNYQF 431 

Qy 421 TKI AVDTAAGP YQNHT WFLGS EKGI I LKFLARI GNSGFLNDSLFLEEMSVYNS EKCS YD 480 

|:| || : |:|:|:: | :|| :: : : : ||||:|: 
Db 432 TQIWDRVDAEDGQYDVMFIGTDVGTVLKWSVPKETWHDLEEVLLEEMTVFR 4 84 

Qy 481 GVEDKRIMGMQLDRAS S S L YVAFSTCVI KVPLGRCERHGKCKKTCI AS RDP YCGWI KEGG 54 0 

I I I : I II: : I : : I I I I : : M I : I I I I I I : I 

Db 485 — EPTTISAMELSTKQQQLYIGSTAGVAQLPLHRCDIYGKACAECCLARDPYCAW— DGS 54 0 

Qy 541 ACSHLSPNS-RLTFEQDIERGNTDGLGDCHNSFVALNGHSSSL LPSTTTSDSTAQE 595 

: II I : I I I I I I I I I : I I II : I : : 

Db 541 SCSRYFPTAKRRTRRQDIRNG— DPLTHCSDLQHHDNHHGPSLEERIIYGVENSSTFLEC 598 

Qy 596 GYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RESYLK 640 

: I : : : | : : I : : : I : : : | : : I I 

Db 599 SPKSQRALVYWQFQRRNEDRKEEI-RMGDHIIRTEQGLLLRSLQK 642 



RESULT 13 
SM3A_HUMAN 

ID SM3A_HUMAN STANDARD; PRT; 771 AA. 

AC Q14563; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Semaphorin 3A precursor ( Semaphorin III) (Sema III). 

GN SEMA3A. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Fetal brain; 

RX MEDLINE-94094332; PubMed=8269517 ; 

RA Kolodkin A.L., Matthes D.J., Goodman C.S.; 

RT "The semaphorin genes encode a family of transmembrane and secreted 

RT growth cone guidance molecules."; 

RL Cell 75:1389-1399(1993). 



RN [2] 

RP SEQUENCE OF 1-37 FROM N.A. 

RA Woessner J., Minx P., Hinds K. , Strowmatt C. ; 

RL Submitted (MAR-1998) to the EMBL/GenBank/DDB J databases. 

RN [3] 

RP SEQUENCE OF 39-182 FROM N.A. 

RA Rohlfing T., Tin-Wollam A.M. , Duckels G.; 

RL Submitted (JUN-1998) to the EMBL/GenBank/DDB J databases. 

CC -!- FUNCTION: INDUCES THE COLLAPSE AND PARALYSIS OF NEURONAL GROWTH 

CC CONES. COULD SERVE AS A LIGAND THAT GUIDES SPECIFIC GROWTH CONES 

CC BY A MOTILITY- INHIBITING MECHANISM. BINDS TO THE COMPLEX 

CC NEUROPILIN-l/PLEXIN-1 (BY SIMILARITY) . 

CC -!- SUBCELLULAR LOCATION: Secreted (By similarity). 

CC -!- DOMAIN: Strong binding to neuropilin is mediated by the carboxy 
CC third of the protein. 

CC -!- SIMILARITY: Belongs to the semaphorin family. 

CC -!- SIMILARITY: Contains 1 Sema domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; L26081; AAA65938.1; -. 

DR EMBL; AC004451; -; NOT_ANNOTATED_CDS . 

DR EMBL; AC004848; AAC78622.1; -. 

DR PIR; D49423; D49423. 

DR Genew; HGNC: 10723; SEMA3A. 

DR MIM; 603961; -. 

DR GO; GO: 0005576; C : extracellular ; TAS . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003599; Ig. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM004 09; IG; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

DR PROSITE; PS50835; IG_LIKE; 1. 

KW Signal; Immunoglobulin domain; Multigene family; Neurogenesis; 

KW Developmental protein; Glycoprotein. 

FT SIGNAL 1 20 POTENTIAL. 

FT CHAIN 21 771 SEMAPHORIN 3A. 

FT DOMAIN 240 538 SEMA. 

FT DOMAIN 580 664 IG-LIKE C2-TYPE. 

FT DOMAIN 727 769 ARG/LYS-RICH (BASIC). 

FT DISULFID 649 722 BY SIMILARITY. 

FT CARBOHYD 53 53 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 125 125 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 590 590 N-LINKED (GLCNAC. . .) (POTENTIAL). 

SQ SEQUENCE 771 AA; 88889 MW; 9985F8D3EAED8456 CRC64; 



Query Match 15.6%; Score 852.5; DB 1; Length 771; 

Best Local Similarity 32.1%; Pred. No. 2.5e-47; 

Matches 208; Conservative 106; Mismatches 250; Indels 83; Gaps 20; 

Qy 44 HKPGRNTTQRHRLDIQMIMIMNGT LYIAARDHIYTVD 8 0 

:: |:| | :| : :: I I I : | : | I I : : I 

Db 22 YQNGKNNVPRLKLSYKEMLESNNVITFNGLANSSSYHTFLLDEERSRLYVGAKDHIFSFD 81 

Qy 81 IDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKH-KDECHNFIKVLLKKNDDALFVCGTNA 139 

: | : | : I I I : I I I I I I I II I I I : I II I 

Db 82 L WIKDFQKIWPVSYTRRDECKWAGKDILKECANFIKVLKAYNQTHLYACGTGA 136 

Qy 140 FNPSCR NYKMDTLEPFGDEF— SGMARCPYD7VKHANVALFADGKLYSATVTDFLA 192 

| : | I : : I : : : I : I I I I : I II : I I I I I I : 

Db 137 FHPICTYIEIGHHPEDNIFKLENSHFENGRGKSPYDPKLLTASLLIDGELYSGTAADFMG 196 

Qy 193 IDAVIYRSLGESPTLRTVKHDSKWLKEPYFVQA VDYGDYIYFFFREIAVEYNT 245 

I I : I : I I : I I : I I I : I I : I I : I I : I I I I I I I : : 

Db 197 RDFAIFRTLGHHHPIRTEQHDSRWLNDPKFISAHLISESDNPEDDKVYFFFRENAIDGEH 256 

Qy 246 MGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPG DSHFYFNILQAVTD 301 

|| I : I : I I M I I I I : I I : I I I I I I I I I I I I : I I : I 

Db 257 SGKATHARIGQICKNDFGG-HRSLVNKWTTFLKARLICSVPGPNGIDTHF DELQD 310 

Qy 302 VIRINGRD WLATFSTPYNSIPGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPVP 357 

I : | : I || I : I I I I I I I I I I : III: : I : II 

Db 311 VFLMNFKDP KN PWYGVFTT S SNI FKGSAVCMYSMS DVRRVFLGP YAHRDGPNYQWVP Y- 369 

Qy 358 DERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMVR 417 

III I I I I I I : : :: : || I : I ::| I I I : III ::| I 
Db 370 QGRVPYPRPGTCP-SKTFGGFDSTKDLPDDVITFARSHPAMYNPVFPMNNRPIVI KTDVN 428 

Qy 418 YRLTKIAVDTAAGPYQNHTWFLGSEKGIILKFLARIGNSGFLNDSLFLEEMSVYNSEKC 477 

| : | : I I I : | : | : | : : | : | | : : : : : : I I I I : I : 
Db 429 YQFTQIWDRVDAEDGQYDVMFIGTDVGTVLKWSIPKETWYDLEEVLLEEMTVFR 484 

Qy 478 SYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGKCKKTCIASRDPYCGWIK 537 

I | | : I II: : I : : I I I I : : I I I : I I I I I I 

Db 485 EPTAISAMELSTKQQQLYIGSTAGVAQLPLHRCDIYGKACAECCLARDPYCAW — 537 

Qy 538 EGGACSHLSPNS-RLTFEQDIERGNTDGLGDC HNSFVALNGHS — SSLLPSTTTSD 590 

: I I I I I : I I I I I I I I I I : : : I I I = I 

Db 538 DGSACSRYFPTAKRRTRRQDIRNG— DPLTHCSDLHHDNH HGHSPEERI IYGVENSS 592 

Qy 591 STAQEGYESRGGMLDWKHLLDSPDSTDPLGAVSSHNHQDKKGVI RES 637 

: : : | : : : I : : : : : I I : : I : : I 

Db 593 TFLECSPKSQRALVYWQFQRRNEERKEEI-RVDDHIIRTDQGLLLRS 638 



RESULT 14 
SM1A_TRICF 

ID SM1A_TRICF STANDARD; PRT; 712 AA. 

AC Q26972; 

DT 30-MAY-2000 (Rel. 39, Created) 
DT 30-MAY-2000 (Rel. 39, Last sequence update) 
DT 16-OCT-2001 (Rel. 40, Last annotation update) 
DE Semaphorin 1A precursor ( Semaphorin-I ) . 



GN SEMA-1A OR TSEMA-I. 

OS Triboliurn confusum (Confused flour beetle) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucu j if ormia; 

OC Tenebrionidae; Triboliurn. 

OX NCBI_TaxID=7071; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE^Embryo; 

RX MEDLINE= 94094332; PubMed= 8269517; 

RA Kolodkin A.L., Matthes D.J., Goodman C.S.; 

RT "The semaphorin genes encode a family of transmembrane and secreted 

RT growth cone guidance molecules."; 

RL Cell 75:1389-1399(1993). 

CC -!- FUNCTION: PLAY A ROLE IN GROWTH CONES GUIDANCE. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- SIMILARITY: Belongs to the semaphorin family. 

CC -!- SIMILARITY: Contains 1 Sema domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 



CC 










DR 


EMBL; L26080; AAA16609.1; -. 




DR 


InterPro; 


IPR003659 


; Plexin- 


like. 


DR 


InterPro ; 


IPR002165; Plexin_ 


repeat . 


DR 


InterPro; 


IPR001627 


; Sema. 




DR 


Pfam; PF01437; PSI; 


1. 




DR 


Pfam; PF01403; Sema 


; 1- 




DR 


SMART; SM00423; PSI 


; 1- 




DR 


SMART ; SM00630; Sema; 1. 




KW 


Signal; Developmental protein; Transmembrane; Glycoproteins- 


KW 


Neurogenesis . 






FT 


SIGNAL 


1 


20 


POTENTIAL. 


FT 


CHAIN 


21 


712 


SEMAPHORIN 1A. 


FT 


DOMAIN 


21 


601 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


602 


622 


POTENTIAL. 


FT 


DOMAIN 


623 


712 


CYTOPLASMIC (POTENTIAL). 


FT 


DOMAIN 


213 


506 


SEMA. 


FT 


DOMAIN 


470 


473 


POLY-VAL . 


FT 


DOMAIN 


612 


616 


POLY-VAL . 


FT 


CARBOHYD 


42 


42 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


69 


69 


N-LINKED (GLCNAC. . .) ( POTENTIAL) 


FT 


CARBOHYD 


161 


161 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


265 


265 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


SQ 


SEQUENCE 


712 AA; 


79751 MW; C0734FE5B9C09FE3 CRC64; 


Query Match 




15.5%; 


Score 845; DB 1; Length 712; 



Best Local Similarity 28.9%; Pred. No. 6.8e-47; 

Matches 230; Conservative 130; Mismatches 270; Indels 166; Gaps 30; 

Qy 12 LLHFAGAGFPEDSEPI SISHGNYTKQYPVFVGHKPGRNTTQRHRLDIQMIMIMN- 65 

I: I I: I : I: ::| III :::l 



Db 



12 LIALCHAWMPDSSSKLINHFKSVESKSFT 



GNATFPDH FIVLNQ 54 



Qy 66 — GTLYIAARDHIYTVDI-DTSHTEEIYCSKKLTWKSRQADVDTCRMKGKHKDECHNFIK 122 

: : : I : : I : I I I : : : I I I I : I I I I : I I : I : 

Db 55 DETSILVGGRNRVYNLSIFDLSERK GGRIDWPSSDAHGQLCILKGKTDDDCQNYIR 110 

Qy 123 VLLKKNDDALFVCGTNAFNPSCRNYKMDTLEPFGD-EFSGMARCPYDAKHANVALFADGK 181 

: I I : I I I I :: I I I I : : I I : I I I : : I : : : : I : 

Db 111 ILYSSEPGKLVICGTNSYKPLCRTYAFKEGKYLVEKEVEGIGLCPYNPEHNSTSVSYNGQ 17 0 

Qy 182 LYSATVTDFLAI DAVIYRSLGESPTLRTVKHDSKWLKEPYFVQAVDYGDYI YFFFREIAV 241 

1:111111 I : I I I II III I I I : I I I I II : I I : I I I I 

Db 171 LFSATVADFSGGDPLIYRE PQRTELSDLKQLNAPNFVNSVAYGDYI FFFYRETAV 225 

Qy 242 EYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPGDSHFYFNILQAVTD 301 

II II I :: I I I : I II : I I I : :: I I I I I I I I I I I I : I I : I I I : : I : : I 

Db 226 EYMNCGKVIYSRVARVCKDDKGGPHQSRDR-WTSFLKARLNCSIPGEYPFYFDEIQSTSD 284 

Qy 302 VI — RINGRD WLATFSTPYNSI PGSAVCAYDMLDIASVFTGRFKEQKSPDSTWTPV 356 

: : || I : : : I I I : I I I I : I I I I I I I I I I I I : : : I I ! I 

Db 285 IVEGRYNSDDSKKIIYGILTTPVNAIGGSAI CAYQMADILRVFEGSFKHQETINSNWLPV 344 

Qy 357 PDERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDEAVPSIFNRPWFLRTMV 416 

I 11:11111 I : II : I I I I I I I I : : I I : : I : I :| : 

Db 345 PQN LVP E P RP GQCVRDS RI LPDKNVNFIKTHSLMED-VPALFGKPVLVRVSL 395 

Qy 417 RYRLTKIAVDTAAGPYQNH — TWFLGSEKGI ILKFIARIGNSGFLNDSLFLEEMSVYNS 474 

: II I I II I | : : : I : : I : I I 
Db 396 QYRFTAITVDPQVKTINNQYLDVLYIGTDDGKVLK 430 

Qy 475 EKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCV IKVP 511 

: : I : I I : I I : I : 

Db 431 AVNIPKRHAKALLYRKYRTSVHPHGAPVKQLKIAPGYGKWWGKDEI 47 8 

Qy 512 LGRCERHGKCKKTCIASRDPYCGWIKEGGACSHLSPNSRLTF-EQDIERGN 561 

I I : I I I : : I I : I I : I : : I I I : I I : 

Db 479 RLANLNHCASKTRC-KDCVELQDPHCAWDAKQNLCVSIDTVTSYRFLIQDWRGDDNKCW 537 

Qy 562 TDGLGDCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESRGGMLDWKHLLDSPDSTDP 618 

II I | | :: | : I I I I I : I I 

Db 538 SPQTDKKTVI KNK PSEVENEIT NSIDEKDL DSSDP 572 

Qy 619 LGAVS S HNHQDKKGVI RE S YLKG HDQLVPVT— LLAI AVI LAFVMGA VFSGITV 670 

I : I I I : : I III I III:: : : I : : I I : : I 

Db 573 LIKTGLDDDSDCDPV-SENSIGGCAVRQQLVIYTAGTLHIVWWSIVGLFSWLYSGLSV 631 

Qy 671 YCV — CDHRRKDVAWQRKE — KELTHSRRGSMS-SVTKLSGLFGDTQSKDPKPEAI LTP 725 

: I : : : : : : I : : : I : : I I I I : I : 

Db 632 FAKFHSDSQYPEAPFIEQHNHLERLSANQTGYLTPRANKAVNLWKVSSSTPRPKKDNLD 691 

Qy 726 LMHNGKLATPGNTAKM 741 

: : : I : I I : 
Db 692 VSKDLNIASDGTLQKI 707 



RESULT 15 
SZ1B BRARE 



ID SZlB_BRARE STANDARD; PRT; 77 8 AA. 

AC Q9W68 6; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Sernaphorin Z1B precursor (Sernaphorin IB) (Sema-ZIB) . 

GN SEMAZ1B OR SEMA3AB. 

OS Brachydanio rerio (Zebrafish) (Danio rerio) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi ; Cyprinif ormes ; 

OC Cyprinidae; Danio. 

OX NCBI_TaxID=7 955; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE-99425174; PubMed=10495275 ; 

RA Roos M. , Schachner M. , Bernhardt R.R.; 

RT "Zebrafish sernaphorin Zlb inhibits growing motor axons in vivo."; 

RL Mech. Dev. 87:103-117(1999). 

CC -!- FUNCTION: MIGHT NORMALLY INFLUENCE THE MIDSEGMENTAL PATHWAY CHOICE 

CC OF THE VENTRALLY EXTENDING MOTOR AXONS BY CONTRIBUTING TO A 

CC REPULSIVE DOMAIN IN THE POSTERIOR SOMITE. 

CC -!- SUBCELLULAR LOCATION: Secreted (By similarity). 

CC -!- TISSUE SPECIFICITY: EXPRESSED IN RHOMBOMERES THREE AND FIVE, AND 

CC IN THE POSTERIOR HALF OF NEWLY FORMED SOMITES WHICH IS AVOIDED BY 

CC VENTRALLY EXTENDING MOTOR AXONS. 

CC -!- SIMILARITY: Belongs to the sernaphorin family. 

CC -!- SIMILARITY: Contains 1 Sema domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin- like C2-type domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF083382; AAD28103.1; 

DR ZFIN; ZDB-GENE-991209-6; sema3ab. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003599; Ig. 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR002165; Plexin_repeat . 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF01437; PSI; 1. 

DR Pfam; PF01403; Sema; 1. 

DR SMART; SM00409; IG; 1. 

DR SMART; SM00423; PSI; 1. 

DR SMART; SM00630; Sema; 1. 

DR PROSITE; PS50835; IG_LIKE; 1. 

KW Signal; Immunoglobulin domain; Multigene family; Neurogenesis; 

KW Developmental protein; Glycoprotein. 

FT SIGNAL 1 17 POTENTIAL. 

FT CHAIN 18 77 8 SEMAPHORIN Z1B. 

FT DOMAIN 241 539 SEMA. 

FT DOMAIN 579 668 IG-LIKE C2-TYPE. 



FT DOMAIN 


721 


776 


ARG/LYS-RICH (BASIC) . 


FT DISULFID 


652 


716 


BY SIMILARITY. 




FT CARBOHYD 


54 


54 


N-LINKED (GLCNAC . . 


) ( POTENTIAL) 


FT CARBOHYD 


127 


127 


N-LINKED (GLCNAC. . 




FT CARBOHYD 


593 


593 


N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


SQ SEQUENCE 


778 AA; 


88904 MW; 4D36F4323AE21895 


CRC64; 


Query Match 




15.5%; 


Score 845; DB 1; 


Length 77 8; 


Best Local Similarity 


34.3%; 


Pred. No. 7.7e-47; 




Matches 194; 


Conservative 


94; Mismatches 204; 


Indels 74; 



Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



Gaps 17; 



66 GTLYIAARDHIYTVD-IDTSHTEEIYCSKKLTWKSRQADVDTCRMKGKH-KDECHNFIKV 123 

I I : : I : II : : : : I : : : : : : I I : I I : II : I I I I I I I 

68 G RL FVGAKDHVL S FN LVD I NMDQQ L 1 SWP S S P S RRDECKWAGKDVQKECANFI KV 122 

124 LLKKNDDALFVCGTNAFNPSCRNYKM DTLEPFGDEF-SGMARCPYDAKHANVALF 177 

I I I : I I I I I : I I : : : I I I : I : I I I I : : 

123 LQPFNQTHLYACGTGAFHPVCAHVEVGKRSEDNTFRLGSSFENGRGKSPYDPKLQTASML 182 

178 ADGKLYSATVTDFLAIDAVIYRSLGESPTLRTVKHDSKWLKEPYFV QAVDY 228 

II : I I : I II: I I : I : I I : : II : I I I : I I : I I I II 

183 IDGELYAGTSADFMGRDFAI FRTLGKHHPIRTEQHDSRWLNDPRFVSVHLIPESDNAED- 241 

229 GDYIYFFFREIAVEYNTMGKWFPRVAQVCKNDMGGSQRVLEKQWTSFLKARLNCSVPG- 287 

I I I I I I I I :: : I I : I : I I I I I I II : I I : I I I I I I I I I I I 

242 -DKIYLFFRENAIDGEQISKATHARIGQLCKNDFGG-HRSLVNKWTTFLKARLVCSVPGL 299 

288 DSHFYFNILQAVTDVIRINGRD WLATFSTPYNSIPGSAVCAYDMLDIASVFT 340 

I : I I : I I :: :| :: I |:| I I I I I I I I I I II 

300 NGIDTHF DELQDVFLMSSKDPKNPIIYAVFTTSSNIFKGSAVCMYSMADIRRVFL 354 

341 GRFKEQKSPDSTWTPVPDERVPKPRPGCCAGSSSLERYATSNEFPDDTLNFIKTHPLMDE 4 00 

I : : I : II .: I I I I I I I I I :::::: I I I I : I : : I I I 
355 GPYAHRDGPNYQWVPFLN-RVPYPRPGTCP-SKTFDGFESTKDFPDDVITFARSHPAMYN 412 

4 01 AVP S I FNRPWFLRTMVRYRLTKI AVDTAAGP YQNHTWFLGS EKGI I LKFLARI GNS GFL 4 60 

I III : : I I I : I : I I I : I : I : I : : I : M : : : 

413 PVFPINNHPI 1 1 KTDVDYQFTQI VVDRVEAEDGQYDVMFI GTDMGTVLKWS I PRGTWHD 472 

4 61 NDSLFLEEMSVYNSEKCSYDGVEDKRIMGMQLDRASSSLYVAFSTCVI KVPLGRCERHGK 520 

: : I I I I : I : I I I : I I I : : I : : I I I I : : I I 

473 LEEVLLEEMTVFR E PTAI TAMEL ST KQQQL YLGS AI GVS QMP LHRC DVYGK 523 



Qy 

Db 

Qy 

Db 



521 CKKTCIASRDPYCGWIKEGGACSHLSPNS-RLTFEQDIERGNTDGLGDCHNSFVALNGHS 579 

I : I I I I I I : I II I : I I III I : 
524 AC AE C C LARD P Y CAW — DGSQCSRYFPTAKRRTRRQDIRNGD 563 

580 SSLLPSTTTSDSTAQEGYESRGGMLD 605 

I I I I : : I : I I 

564 PLTQCSDLQHHDEADGEAGLLD 585 



Search completed: March 24, 2004, 13:14:55 
Job time : 23.5626 sees 



